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The  systematization  which  occurs  in  a  theoretical  science ,  as  we 
may  properly  call  it  in  order  to  distinguish  it  from  a  natural  or  an 
applied  science ,  is  a  process  which  is  apt  to  come  late  in  the  development 
of  a  subject.  Evidently  some  fields  of  knowledge  are  hardly 
ready  for  it,  for  it  is  typified  by  a  free  spirit  of  making  hypotheses 
and  definitions  rather  than  a  mere  recognition  of  facts.  But  when  we 
find  this  feeling  for  hypothesis  and  definition  and ,  in  addition ,  become 
involved  in  chains  of  deductive  reasoning ,  we  are  driven  to  a 
characteristic  method  of  construction  and  analysis  which  we  may  call 
the  mathematical  method.  It  is  not  a  question  as  to  whether  mathematics 
is  desirable  or  not  in  such  a  subject.  We  are  in  fact  forced  to  adopt 
the  mathematical  method  as  a  condition  of  further  progress. 
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PREFACE  TO  THE  REVISED  EDITION 


The  first  edition  of  this  book  has,  during  its  short  life  of  some  half  a  dozen 
years,  been  very  widely  used  in  colleges,  universities,  and  teachers’  summer 
institutes.  It  is  a  comfort  to  the  authors  that  the  continued  and  growing 
demand  for  the  book  has  warranted  the  production  of  a  revised  edition. 
Although  the  revised  edition  remains  basically  like  the  first  one,  a  large 
number  of  small  changes  and  improvements  have  been  made,  some  of  the 
passages  have  been  brought  up  to  date,  an  additional  section  for  Chapter  1 
was  written,  several  significant  new  problems  and  four  pertinent  appendix 
topics  have  been  added,  and  the  bibliography  has  been  augmented  with  a 
number  of  important  recent  titles. 

With  some  urging,  but  also  with  some  reluctance,  the  term  “the 
mathematical  method”  has  been  changed  in  the  revised  edition  to  the 
axiomatic  method.”  Though  history  and  affection  clearly  support  the  former 
term,  it  must  be  admitted  that  the  latter  one  is  more  descriptive  and  more 
democratic. 

Many  of  the  small  improvements  incorporated  in  the  revised  edition 
are  the  result  of  constructive  suggestions  made  by  interested  users  of  the 
first  edition;  the  authors  are  very  grateful  to  these  persons.  Particular  thanks, 
along  this  line,  go  to  Professor  Clayton  W.  Dodge  of  the  University  of  Maine, 
whose  counsel  in  the  preparation  of  the  revised  edition  has  been  invaluable. 

Howard  Eves  and  Carroll  V.  Newsom 


Stillwater ,  Maine ,  and 
Candlewood  Isle ,  Connecticut 
January  1965 
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PREFACE  TO  THE  FIRST  EDITION 


J.  W.  N.  Sullivan,  writing  in  the  Atlantic  Monthly  for  March,  1933,  stated, 
ccIt  has  become  evident  that,  so  far  as  the  science  of  physics  is  concerned, 
we  do  not  require  to  know  the  nature  of  the  entities  we  discuss,  but  only  their 
mathematical  structure.  And,  in  truth,  that  is  all  we  do  know.  A  similar 
recognition  of  the  role  of  mathematics  m  virtually  all  the  systematic  dis¬ 
ciplines  is  now  common  among  scholars.  In  fact,  the  distinguished  Harvard 
Committee  of  1945,”  author  of  the  report  entitled  General  Education  in  a 
Free  Society ,  emphasized  such  a  role  in  the  assertion,  “Mathematics  may  be 
defined  as  the  science  of  abstract  form  •  •  • .  The  discernment  of  structure  is 
essential  no  less  to  the  appreciation  of  a  painting  or  a  symphony  than  to 
understanding  the  behavior  of  a  physical  system;  no  less  in  economics  than  in 
astronomy.  Mathematics  studies  order  abstracted  from  the  particular  objects 
and  phenomena  which  exhibit  it,  and  in  a  generalized  form. 

There  is  little  doubt  that  man’s  rapid  progress  in  recent  decades  in  the 
control  and  understanding  of  nature,  in  providing  himself  abstract  tools  of 
creation  that  border  upon  the  miraculous,  and  in  his  actual  comprehension 
of  the  powers  and  limitations  of  the  human  mind  is  a  direct  consequence  ol 
mathematical  triumphs  of  the  last  few  centuries,  especially  the  nineteenth 
and  twentieth.  Thus  scholarly  endeavor  in  virtually  all  areas  of  human 
activity  requires  to  an  increasing  extent  a  knowledge  of  mathematics  and  the 
ability  to  use  it;  except  for  the  study  of  language,  mathematics  may  well  be 
the  most  basic  component  of  a  so-called  geneial  education. 

Yet  even  students  of  mathematics  are  usually  denied  until  their  most 
advanced  years  of  study  an  understanding  of  the  meaning  and  nature  of 
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mathematics.  They  labor  under  false  definitions  and  impressions,  which, 
unfortunately,  continue  to  be  promulgated  in  most  elementary  courses.  Until 
they  have  reached  the  graduate  level  students  generally  have  heard  little  of 
mathematical  structure,  to  which  Sullivan  referred;  they  have  virtually  no 
acquaintance  with  the  common  collections  of  axioms,  with  the  postulational 
method,  and  with  the  nature  and  use  of  mathematical  systems  and  models; 
it  is  probable  that  they  have  little  knowledge  of  the  real  number  system 
and  have  had  little  more  than  a  superficial  experience  in  working  with  such  a 
fundamental  concept  as  set. 

This  book  has  been  written,  therefore,  in  an  attempt  to  make  available 
to  advanced  undergraduate  students  an  introductory  treatment  of  the 
foundations  of  mathematics  and  of  concepts  that  are  basic  to  mathematical 
knowledge.  A  course  of  study  utilizing  this  work  as  a  text  would,  in  the 
opinion  of  the  authors,  rectify  a  great  deal  of  the  curricular  deficiency 
described  above.  The  contents  were  selected  with  considerable  care  in  order 
that  the  exposition  might  have  value  not  only  to  mathematical  scholars  and 
scientists  but  also  to  philosophers,  historians,  and  others;  it  is  especially  to  be 
hoped  that  potential  teachers  of  mathematics  will  have  an  opportunity  to 
study  under  competent  instruction  such  material  as  this  book  contains. 

The  treatment  is  strongly  historical,  for  the  study  is  concerned  with 
fundamental  mathematical  ideas;  a  genuine  understanding  of  ideas  is  not 
possible  without  an  analysis  of  origins.  Even  the  order  of  topics,  as  revealed 
by  the  table  of  contents,  provides  in  a  rough  way  a  chronological  develop¬ 
ment  of  the  basic  concepts  that  have  made  mathematics  what  it  is  today. 
Obviously,  the  treatment  at  many  points  is  far  from  exhaustive,  for  the 
exposition  has  been  designed  to  be  compatible  with  the  level  of  maturity  and 
understanding  of  the  advanced  undergraduate  student;  the  bibliography 
contains  many  suggestions  for  those  who  desire  to  learn  more  about  a 
particular  topic. 

The  material  comprising  the  book  has  been  successfully  given  in  college 
and  university  courses  by  each  of  the  authors,  and  both  feel  that  it  is 
eminently  suitable  for  undergraduate  courses  bearing  such  titles  as  Foundations 
of  Mathematics ,  Fundamental  Concepts  of  Mathematics ,  or  Fundamental  Concepts 
of  Algebra  and  Geometry.  The  material  is  elastic  and  can  be  adapted  to  courses 
of  varying  lengths  simply  by  spending  more  or  less  time  on  detailed  demon¬ 
strations  and  problems;  in  this  way  the  material  has  been  utilized  in  trimester, 
semester,  and  year  courses.  All  problems  requiring  a  knowledge  of  the 
calculus  are  so  marked. 

The  book  will  have  accomplished  its  purpose  for  many  readers  if  it 
does  no  more  for  them  than  explain  the  nature  of  geometric  and  algebraic 
systems.  It  is  the  hope  of  the  authors,  however,  that  their  efforts  may  be 
much  more  rewarding;  perhaps  some  students  of  mathematics  will  for  the 
first  time  see  the  forest  without  becoming  confused  by  the  trees,  and  a  few 
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students  may  obtain  from  the  work  a  glimpse  of  the  meaning  and  opportuni¬ 
ties  of  mathematical  creation,  the  finest  testimony  to  man’s  inherent  genius. 

It  is  a  pleasure  to  thank  The  Cambridge  University  Press,  The  Mac¬ 
millan  Company,  McGraw-Hill  Book  Company,  Inc.,  Mrs.  R.  E.  Moritz, 
and  Mrs.  C.  J.  Keyser  for  graciously  granting  us  permission  to  quote  from 
certain  works.  Much  of  the  task  of  writing  the  book  was  made  easier  by  the 
understanding  and  cooperation  of  Professor  Spofford  H.  Kimball,  chairman 
of  the  Department  of  Mathematics  and  Astronomy  at  the  University  of 
Maine.  A  large  part  of  the  manuscript  was  read  and  critically  discussed  from 
the  philosophical  point  of  view  by  Dr.  Charles  G.  Werner  of  the  University 
of  Miami.  An  undoubted  debt  is  owed  to  Professor  Raymond  L.  Wilder  of 
the  University  of  Michigan,  with  whom  one  of  the  authors  first  studied  such 
matters  as  are  contained  herein.  The  conscientious  typing  of  the  manuscript 
was  undertaken  by  Mrs.  G.  Robert  Butler. 

Howard  Eves  and  Carroll  V.  Newsom 

Stillwater ,  Maine ,  and 
Candlewood  Isle ,  Connecticut 
October  1957 
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_  CHAPTER  ONE 

MATHEMATICS  BEFORE  EUCLID 


1.1  The  Empirical  Nature 

of  Pre- Hellenistic  Mathematics 

The  thesis  can  be  advanced  that  mathematics  arose  from  necessity. 
The  annual  inundation  of  the  Nile  Valley,  for  example,  forced  the  Egyptians 
to  develop  some  system  for  redetermining  land  markings;  in  fact,  the  word 
geometry  means  “measurement  of  the  earth.”  The  need  for  mensuration  for¬ 
mulas  was  especially  imperative  if,  as  Herodotus  remarked,  taxes  in  Egypt 
were  paid  on  the  basis  of  land  area.  The  Babylonians  likewise  encountered  an 
urgent  need  for  mathematics  in  the  construction  of  the  great  engineering 
structures  for  which  they  were  famous.  Marsh  drainage,  irrigation,  and  flood 
control  made  it  possible  to  convert  the  land  along  the  Tigris  and  Euphrates 
rivers  into  a  rich  agricultural  region.  Similar  undertakings  undoubtedly 
occurred  in  early  times  in  south-central  Asia  along  the  Indus  and  Ganges 
rivers,  and  in  eastern  Asia  along  the  Hwang  Ho  and  the  Yangtze.  The 
engineering,  financing,  and  administration  of  such  projects  required  the 
development  of  considerable  technical  knowledge  and  its  attendant  mathe¬ 
matics.  Again,  a  usable  calendar  had  to  be  computed  to  serve  agricultural 
needs.  Finally,  the  demand  for  some  system  of  uniformity  in  barter  was 
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present  in  even  the  earliest  civilizations;  this  fact  also  furnished  a  pronounced 
stimulus  to  mathematical  development.* 

Thus  there  is  a  basis  for  saying  that  mathematics,  beyond  that  implied 
by  primitive  counting,  originated  during  the  period  of  the  fifth,  fourth,  and 
third  millennia  b.c.  in  certain  areas  of  the  ancient  orient  as  a  practical  science 
to  assist  in  engineering,  agricultural,  and  business  pursuits.  Although  the 
initial  emphasis  was  on  mensuration  and  practical  arithmetic,  it  was  natural 
that  a  special  craft  should  come  into  being  for  the  application,  instruction, 
and  development  of  the  science,  and  that,  in  turn,  tendencies  toward  abstrac¬ 
tion  should  then  assert  themselves  and  the  subject  be  studied,  to  some  extent, 
for  its  own  sake.  In  this  way  a  basis  for  the  beginnings  of  theoretical  geometry 
grew  out  of  mensuration,  and  the  first  traces  of  elementary  algebra  evolved 
from  practical  arithmetic. 

In  our  study  of  early  mathematics  we  are  restricted  essentially  to  that 
of  Egypt  and  Babylonia.  The  ancient  Egyptians  recorded  their  work  on  stone 
and  papyrus,  the  latter  fortunately  enduring  because  of  Egypt’s  unusually 
dry  climate;  the  Babylonians  used  imperishable  baked  clay  tablets.  In  con¬ 
trast  to  the  use  of  these  media,  the  early  Indians  and  Chinese  used  very 
perishable  writing  materials  like  bark  and  bamboo.  Thus  it  has  come  to  pass 
that  we  have  a  fair  quantity  of  definite  information,  obtained  from  primary 
sources,  about  the  science  and  the  mathematics  of  ancient  Egypt  and 
Babylonia,  while  we  know  very  little  indeed,  with  any  degree  of  certainty, 
about  these  fields  of  study  in  ancient  India  and  China. 

It  is  the  nature,  rather  than  the  content,  of  this  early  pre-Hellenistic 
mathematics  that  concerns  us  here,  and  in  this  regard  it  is  important  to  note 
that,  outside  of  very  simple  considerations,  the  mathematical  relations 
employed  by  the  Egyptians  and  by  the  Babylonians  resulted  essentially  from 
‘‘trial  and  error”  methods.  In  other  words,  to  a  great  extent  the  earliest 
mathematics  was  little  more  than  a  practically  workable  empiricism— a  col¬ 
lection  of  rule-of-thumb  procedures  that  gave  results  of  sufficient  acceptability 
for  the  simple  needs  of  those  early  civilizations.  Thus  the  Egyptian  and 
Babylonian  formulas  for  volumes  of  granaries  and  areas  of  land  were  arrived 
at  by  trial  and  error,  with  the  result  that  many  of  these  formulas  are  definitely 
faulty.  For  example,  an  Egyptian  formula  for  finding  the  area  of  a  circle  was 
to  take  the  square  of  eight  ninths  of  the  circle’s  diameter.  This  is  not  correct, 
as  it  is  equivalent  to  taking  ir  =  (4/3) 4  =  3.1604  ■  *  • .  The  even  less  accurate 
value  of  7r  =  3  is  implied  by  some  Babylonian  formulas.  |  Another  incorrect 
formula  found  in  ancient  Babylonian  mathematics  is  one  which  says  that  the 

*There  is  an  alternative  thesis  that  finds  the  origin  of  mathematics  in  religious  ritual  — 
agriculture,  trade^  and  surveying  being  later  contributors.  See  A.  Seidenberg  [1] 
and  [2],  (References  by  author’s  name  only  are  to  the  Bibliography  at  the  end  of 
the  book.) 

|This  value  for  7 r  is  also  found  in  the  Bible;  see  I  Kings  7:23,  and  II  Chron.  4:2. 
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volume  of  a  frustum  of  a  cone  or  of  a  square  pyramid  is  given  by  the  product 
of  the  altitude  and  half  the  sum  of  the  bases.  It  seems  that  the  Babylonians 
also  used,  for  the  area  of  a  quadrilateral  having  <2,  b,  c,  d  for  its  consecutive 
sides,  the  incorrect  formula  K  =  {a  +  c)(b  +  d)  / 4.  This  formula  gives  the 
correct  result  only  if  the  quadrilateral  is  a  rectangle;  in  every  other  instance 
the  formula  gives  too  large  an  answer.  It  is  curious  that  this  same  incorrect 
formula  was  reproduced  2000  years  later  in  an  Egyptian  inscription  found 
in  the  tomb  of  Ptolemy  xi,  who  died  in  51  b.c.  ! 

In  general,  simple  empirical  reasoning  may  be  described  as  the  formu¬ 
lation  of  conclusions  based  upon  experience  or  observation;  no  real  under¬ 
standing  is  involved,  and  the  logical  element  does  not  appear.  Empirical 
reasoning  often  entails  stodgy  fiddling  with  special  cases,  observation  of 
coincidences,  experience  at  good  guessing,  considerable  experimentation,  and 
flashes  of  intuition.  Perhaps  a  very  simple  hypothetical  illustration  of  empiri¬ 
cal  reasoning  might  clarify  what  is  meant  by  this  type  of  procedure. 

Suppose  a  farmer  wishes  to  enclose  with  200  feet  of  fencing  a  rectangular 
field  of  greatest  possible  area  along  a  straight  river  bank,  no  fencing  being 
required  along  the  river  side  of  the  field.  If  we  designate  as  the  depth  of  the 
field  the  dimension  of  the  field  perpendicular  to  the  river  bank,  and  as  the 
length  of  the  field  the  dimension  parallel  to  the  river  bank  (see  Figure  1), 
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FIG.  1 

the  farmer  could  soon  form  the  following  table: 


depth 
in  feet 

length 
in  feet 

area  in 
square  feet 

10 

180 

1800 

20 

160 

3200 

30 

140 

4200 

40 

120 

4800 

50 

100 

5000 

60 

80 

4800 

70 

60 

4200 

80 

40 

3200 

90 

20 

1800 
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Examination  of  the  table  shows  that  the  maximum  area  recorded  occurs 
when  the  depth  is  50  feet  and  the  length  is  100  feet.  The  interested  farmer 
might  now  try  various  depths  close  to,  but  on  each  side  of,  50,  and  would 
perhaps  make  the  following  additional  table: 


depth 

length 

area 

48 

104 

4992 

49 

102 

4998 

50 

100 

5000 

51 

98 

4998 

52 

96 

4992 

By  now  the  farmer  would  feel  quite  certain  that  the  maximum  area  is 
obtained  when  the  depth  is  50  feet  and  the  length  is  100  feet;  that  is,  he  would 
accept  the  proposition  that  the  maximum  area  occurs  when  the  length  of  the  field 
is  twice  the  depth  of  the  field.  A  further  strengthening  of  this  belief  would  result 
from  his  examination  of  the  symmetry  observed  in  his  table,  and  he  would 
no  doubt  use  his  conjecture  and  pass  it  along  to  others  as  a  reliable  mathe¬ 
matical  fact.  Of  course  the  farmer’s  conclusion  is  by  no  means  established, 
and  no  present-day  student  of  mathematics  would  be  permitted  to  “prove” 
the  conjecture  in  this  fashion.  Shrewd  guessing  has  taken  the  place  of 
deductive  logic;  patience  has  replaced  brilliance. 

In  spite  of  the  empirical  nature  of  ancient  oriental  mathematics,  with 
its  complete  neglect  of  proof  and  the  seemingly  little  attention  paid  to  the 
difference  between  exact  and  approximate  truth,  one  is  nevertheless  struck 
by  the  extent  and  the  diversity  of  the  problems  successfully  attacked.  Par¬ 
ticularly  has  this  become  evident  in  recent  years  with  the  scholarly  decipher¬ 
ing  of  many  Babylonian  mathematical  tablets.  Apparently  a  great  deal  of 
elementary  mathematical  truth  can  be  discovered  by  empirical  methods 
when  supplemented  by  extensive  experimentation  carried  on  patiently  over 
a  long  period  of  time. 

How  were  the  mathematical  findings  of  the  ancient  orient  stated?  Here 
we  must  rely  upon  such  primary  sources  as  the  Rhind,  the  Moscow,  and  other 
Egyptian  'mathematical  papyri,  and  upon  the  approximately  three  hundred 
Babylonian  mathematical  tablets  that  have  so  far  been  deciphered. 

The  Rhind,  or  Ahmes,  papyrus  is  a  mathematical  text  dating  from  about 
1650  b.c.  Partaking  of  the  nature  of  a  practical  handbook,  it  contains  85 
problems  copied  by  the  scribe  Ahmes  from  a  still  earlier  work.  Now  possessed 
by  the  British  Museum,  it  was  originally  purchased  in  Egypt  by  the  English 
Egyptologist,  A.  Henry  Rhind.  This  papyrus  and  the  somewhat  older 
Moscow  papyrus,  a  similar  mathematical  text  containing  25  problems, 
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constitute  our  chief  sources  of  information  concerning  ancient  Egyptian 
mathematics.  All  of  the  110  problems  found  in  these  papyri  are  numerical, 
and  many  of  them  are  very  easy.  In  general,  each  problem  is  first  formulated 
and  then  followed  by  a  step-by-step  solution  using  the  special  numbers  given 
at  the  beginning.  Although  special  numbers  are  employed  in  this  fashion, 
one  feels  that  they  are  incidental  and  are  being  used  merely  to  illustrate  a 
general  procedure.  Many  of  the  problems  require  nothing  more  than  a  simple 
linear  equation,  and  are  generally  solved  by  the  method  known  later  in 
Europe  as  the  rule  of  false  position.  This  rule  clearly  reflects  the  empirical 
nature  of  the  mathematical  procedures  of  the  time.  As  an  example,  suppose 
we  are  to  solve  the  simple  equation  x  +  (x/ 5)  =  24.  Assume  any  convenient 
value  for  x,  say,  x  =  5.  Then  *  +  (x/5)  =  6,  instead  of  24.  Since  6  must  be 
multiplied  by  4  to  give  the  required  24,  the  correct  value  of  *  must  be  4(5), 
or  20. 

The  Babylonian  mathematical  tablets  are  of  two  types,  table  texts  and 
problem  texts.  There  must  be  at  least  500,000  Babylonian  tablets  now  scattered 
among  various  museums  of  the  world;  of  these  only  about  100  problem  texts, 
and  somewhat  more  than  twice  this  number  of  table  texts,  are  known  to  us. 
The  table  texts  exhibit  a  wide  variety  of  mathematical  tables,  such  as 
multiplication  tables,  tables  of  reciprocals  (for  reducing  division  to  multipli¬ 
cation),  tables  of  squares  and  square  roots  and  cubes  and  cube  roots,  tables 
of  sums  of  squares  and  cubes  (for  solving  certain  types  of  cubic  equations), 
exponential  tables  (for  computing  compound  interest),  and  many  others. 
The  ancient  Babylonians  were  indefatigable  table  makers,  as  one  might  have 
expected,  for  the  construction  of  tables  is  indispensable  to  empirical  procedure. 

The  problem  texts  also  show  considerable  variety  and  are  all  more  or 
less  concerned  with  the  formulation  and  solution  of  algebraic  and  geometric 
problems.  A  large  group  of  the  problem  texts,  like  the  Egyptian  papyri 
considered  above,  formulate  a  problem  in  terms  of  specific  numbers  and  then 
proceed  with  a  step-by-step  solution  using  the  specific  numbers.  Such  texts 
often  terminate  with  the  phrase,  “such  is  the  procedure.”  Again  it  is  apparent 
that  it  is  the  general  procedure,  and  not  the  numerical  result,  that  is  con¬ 
sidered  important.  If,  in  a  multiplication,  a  factor  has  the  value  1,  multipli¬ 
cation  by  this  1  will  be  explicitly  performed,  for  this  step  is  necessary  in  the 
general  case.  The  remaining  problem  texts  contain  on  a  single  tablet,  often 
not  as  large  as  a  page  of  this  book,  a  large  number  of  related  numerical 
problems  carefully  arranged  from  the  simplest  cases  up  through  the  more 
complicated  ones.  The  apparent  purpose  of  such  a  text  was  to  teach,  by 
repetition  and  gradual  introduction  of  complexities,  a  certain  method  or 
procedure,  and  the  accompanying  numbers  serve  merely  as  a  guide  to  illus¬ 
trate  the  underlying  general  procedure.  The  solution  of  quadratic  equations, 
for  example,  both  by  general  formula  and  by  the  method  of  completing  the 
square,  is  explained  in  this  way  on  ancient  Babylonian  tablets. 
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In  summary,  then,  we  find  that  pre-Hellenistic  mathematics  was 
empirical.  Nowhere  do  we  find  in  ancient  oriental  mathematics  a  single 
instance  of  what  we  today  call  a  logical  demonstration.  Instead  of  an  argu¬ 
ment  we  find  a  description  of  a  process  explained  by  means  of  specific  numeri¬ 
cal  cases.  In  short,  we  are  instructed  to  “Do  thus  and  so.”  It  is  very  interesting 
to  note  that  although  today  confirmed  students  of  the  scientific  method 
find  this  “Do  thus  and  so”  procedure  highly  unsatisfactory  it  is  the  procedure 
employed  in  much  of  our  elementary  teaching. 

1.2  Induction  versus  Deduction 

Empirical  conclusions,  we  have  seen,  are  generalizations  based  upon  a 
limited  number  of  observations  or  experiments.  For  example,  the  farmer  of 
the  previous  section  obtained  a  general  rule  by  observing  a  limited  number  of 
computed  areas.  Another  farmer  may  observe  that  unusually  good  crops 
have  followed  a  number  of  winters  of  heavy  snow,  and  empirically  conclude 
that  snowy  winters  are  beneficial  to  crops.  As  a  further  example,  a  scientist 
may  observe  that  particularly  fine  displays  of  the  aurora  borealis  always 
occurred  in  his  experience  during  periods  of  pronounced  sun-spot  activity, 
and  conclude  that  there  must  be  a  connection  between  the  two  phenomena. 
This  type  of  reasoning,  which  concludes  on  the  basis  of  a  limited  number 
of  instances  that  something  is  always  true,  is  known  as  induction.  *  Modern 
probability  considerations  have  served  to  introduce  refinements  into  induc¬ 
tive  procedures.  It  is  important  to  note,  however,  that  no  matter  how  fully 
the  conclusions  of  inductive  reasoning  may  seem  warranted  by  the  facts, 
these  conclusions  are  not  established  beyond  all  possible  doubt;  conclusions 
obtained  by  induction  are  only  more  or  less  probable. 

Empirical  conclusions  are  sometimes  reached  by  using  a  primitive  form 
of  induction  known  as  reasoning  by  analogy.  For  example,  if  we  cut  off  the 
top  of  a  triangle  by  a  line  parallel  to  the  base  of  the  triangle,  a  trapezoid  will 
remain,  and  the  area  of  a  trapezoid  is  given  by  the  product  of  its  altitude 
and  the  arithmetic  average  of  its  two  bases.  Now,  if  we  cut  off  the  top  of  a 
pyramid  by  a  plane  parallel  to  the  base  of  the  pyramid,  a  frustum  will  remain. 
By  analogy  one  might  expect  the  volume  of  a  frustum  of  a  pyramid  to  be 
given,  as  before,  by  the  product  of  the  altitude  and  the  arithmetic  average 
of  the  two  bases.  This  is  the  incorrect  Babylonian  formula  noted  in  the 
previous  section.  Reasoning  by  analogy  certainly  is  useful,  but  obviously  its 
conclusions  cannot  be  regarded  as  established. 

In  sharp  contrast  to  reasoning  by  analogy  or  by  induction  is  reasoning 
by  deduction,  because  the  conclusions  reached  by  deduction,  provided  one 
accepts  the  premises  that  are  adopted  and  the  system  of  logic  that  is  em- 

*Not  to  be  confused  with  so-called  mathematical  induction ,  which  is  considered  in 

Section  7.3. 
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ployed,  are  incontestable.  To  illustrate  deductive  procedure,  consider  the 
following  two  statements:  (1)  All  Canadians  are  Americans,  (2)  Two  par¬ 
ticular  men  under  consideration  are  Canadians.  If  we  accept  these  two 
statements,  we  are  logically  compelled,  following  accepted  principles  of 
Aristotelian  logic,  to  accept  a  third  statement,  namely,  (3)  The  men  under 
consideration  are  Americans.  This  is  an  example  of  deductive  reasoning, 
which  at  this  point  may  be  described  as  those  ways  of  deriving  new  statements 
from  accepted  ones  that  compel  us  to  accept  the  derived  statements.  In  the 
example,  the  first  two  statements  are  called  premises ,  and  the  third  statement 
the  conclusion. 

It  is  very  important  to  realize  that  in  deductive  reasoning  we  are  not 
concerned  with  the  truth  of  the  conclusion  but  rather  whether  the  conclusion 
does  or  does  not  follow  from  the  premises.  If  the  conclusion  follows  from  the 
premises,  we  say  that  our  reasoning  is  valid ;  if  it  does  not,  we  say  that  our 
reasoning  is  invalid.  For  example,  from  the  two 

premises:  (1)  All  college  students  are  clever, 

(2)  All  freshmen  are  college  students, 

follows  the 

conclusion:  All  freshmen  are  clever. 

Now  the  last  statement  certainly  is  not  regarded  generally  as  true,  but  the 
reasoning  leading  to  it  is  valid.  If  both  of  the  premises  had  been  true ,  the  conclusion 
also  would  have  been  true ;  it  is  essential  that  one  understand  early  in  the  treat¬ 
ment  of  this  book  this  meaning  of  the  deductive  process. 

A  useful  and  easy  way  to  test  the  validity  of  a  piece  of  deductive  reason¬ 
ing,  like  either  of  the  examples  given  above,  is  by  a  diagrammatic  procedure 
ascribed  to  the  eminent  Swiss  mathematician  Leonhard  Euler  (1707-1783). 
Consider  our  last  example.  We  may  represent  the  class  of  all  clever  people 


by  a  planar  region  within  a  closed  boundary,  and  we  may  do  likewise  for  the 
class  of  all  college  students  and  for  the  class  of  all  freshmen.  But  statement  (1) 
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insists  that  the  class  of  all  college  students  is  contained  in  the  class  of  all  clever 
people,  and  statement  (2)  insists  that  the  class  of  all  freshmen  is  contained  in 
the  class  of  all  college  students.  Thus  our  various  classes  must  be  represented 
by  their  corresponding  regions  as  shown  in  Figure  2.  Clearly,  the  require¬ 
ments  of  our  premises  forced  us  to  place  the  class  of  all  freshmen  entirely  within 
the  class  of  all  clever  people,  which  is  exactly  what  our  conclusion  asserts. 
Hence,  although  the  conclusion  is  undoubtedly  false,  the  reasoning  leading 
to  it  is  valid.  It  cannot  be  overemphasized  at  this  point  that  the  expert  in  the 
use  of  deduction  is  not  fundamentally  concerned  with  truth  but  with  validity ; 
he  merely  wants  to  be  able  to  assert  that  his  conclusions  are  implied  by  the 
premises.  It  would  then  follow  that  if  the, premises  should  happen  to  be  true, 
the  conclusion  must ,  of  necessity,  also  be  true. 

Consider,  as  a  second  example,  the  following: 

premises:  (1)  All  parallelograms  are  polygons. 

(2)  All  quadrilaterals  are  polygons. 

conclusion:  All  parallelograms  are  quadrilaterals. 

Here  all  three  statements  are  true,  but  the  reasoning  is  invalid,  for  the 
premises  do  not  force  us  to  place  the  region  representing  the  class  of  all 
parallelograms  entirely  within  the  region  representing  the  class  of  all 
quadrilaterals;  we  are  able  to  satisfy  the  requirements  of  our  premises  by  a 
diagram  like  that  shown  in  Figure  3. 

As  a  third  example,  consider  the  following: 

premises:  (1)  All  parallelograms  are  circles. 

(2)  All  circles  are  polygons. 

conclusion:  All  parallelograms  are  polygons. 

Here  the  premises  are  both  false,  the  conclusion  is  true,  and,  as  tested  by 
the  diagram  of  Figure  4,  the  reasoning  is  valid.  Thus  false  assumptions  may 


FIG.  3 


FIG.  4 
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actually  yield  a  true  conclusion.  True  premises  can  yield  only  true  conclu¬ 
sions  when  deductive  logic  is  applied,  but  false  premises  may  or  may  not  yield 
true  conclusions. 

Finally,  we  shall  examine  the  following: 

premises:  (1)  No  quadrilaterals  are  triangles. 

(2)  Sorqe  quadrilaterals  are  parallelograms. 
conclusion:  Some  parallelograms  are  not  triangles. 

Since,  by  (1),  the  region  representing  the  class  of  all  quadrilaterals  and  that 
representing  the  class  of  all  triangles  cannot  overlap,  and,  by  (2),  the  region 


FIG.  5 

representing  the  class  of  all  quadrilaterals  and  that  representing  the  class  of 
all  parallelograms  must  overlap,  the  conclusion  (see  Figure  5)  certainly 
follows,  and  the  reasoning  is  valid.  Note,  however,  that  we  cannot  conclude, 
from  our  premises,  that  no  parallelogram  is  a  triangle,  for  there  is  nothing 
that  forces  us  to  keep  the  region  representing  the  class  of  all  parallelograms 
from  cutting  into  the  region  representing  the  class  of  all  triangles. 

Euler’s  diagrammatic  device  can  be  used  in  a  great  variety  of  situations, 
and  it  is  recommended  to  the  person  unfamiliar  with  logical  procedure. 

We  shall  not,  for  the  present,  go  beyond  the  above  superficial  study  of 
inductive  and  deductive  reasoning.  As  already  indicated,  deductive  reasoning 
has  the  advantage  that  its  conclusions  are  unquestionable  if  the  premises 
are  accepted,  and  it  has  the  additional  advantage  of  considerable  economy: 
before  a  bridge  is  built  and  put  into  use,  deductive  reasoning  can  determine 
the  outcome.  But,  in  spite  of  these  particular  advantages,  deductive  reasoning 
does  not  supplant  the  inductive  approach;  actually,  each  way  of  obtaining 
knowledge  has  its  advantages  and  disadvantages.  The  significant  thing,  from 
the  point  of  view  of  our  present  study,  is  that  the  ancient  Greeks  found  in 
deductive  reasoning  the  vital  element  of  the  modern  mathematical  method. 
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1.3  Early  Greek  Mathematics 
and  the  Introduction 
of  Deductive  Procedures 

The  origin  of  early  Greek  mathematics  is  clouded  by  the  greatness  of 
Euclid’s  Elements ,  written  about  300  b.c.,  because  this  work  so  clearly  excelled 
many  preceding  Greek  writings  on  mathematics  that  the  earlier  works  were 
thenceforth  discarded.  As  a  great  mathematician*  once  remarked,  one  can 
measure  the  importance  of  a  scientific  work  by  the  number  of  earlier  publi¬ 
cations  rendered  superfluous  by  it. 

The  debt  of  Greek  mathematics  to  ancient  oriental  mathematics  is 

•* 

difficult  to  evaluate,  nor  has  the  path  of  transmission  from  the  one  to  the 
other  yet  been  satisfactorily  uncovered.  That  the  debt  is  considerably  greater 
than  formerly  believed  became  evident  with  twentieth-century  researches  of 
Babylonian  and  Egyptian  records.  Greek  writers  themselves  expressed  respect 
for  the  wisdom  of  the  East,  and  this  wisdom  was  available  to  anyone  who 
could  travel  to  Egypt  and  Babylonia.  There  are  also  internal  evidences  of  a 
connection  with  the  East.  Early  Greek  mysticism  in  mathematics  smacks 
strongly  of  oriental  influence,  and  some  Greek  writings,  like  those  of  Heron 
and  Diophantus,  exhibit  a  Hellenistic  perpetuation  of  the  more  arithmetic 
tradition  of  the  orient.  Also,  there  are  strong  links  connecting  Greek  and 
Mesopotamian  astronomy. 

But,  whatever  the  strength  of  the  historical  connection  between  Greek 
and  ancient  oriental  mathematics,  the  Greeks  transformed  the  subject  into 
something  vastly  different  from  the  set  of  empirical  conclusions  worked  out 
by  their  predecessors.  The  Greeks  insisted  that  mathematical  facts  must  be 
established ,  not  by  empirical  procedures,  but  by  deductive  reasoning;  mathe¬ 
matical  conclusions  must  be  assured  by  logical  demonstration  rather  than 
by  laboratory  experimentation. 

This  is  not  to  say  that  the  Greeks  shunned  preliminary  empirical  and 
experimental  methods  in  mathematics,  for  it  is  probably  quite  true  that  few, 
if  any,  significant  mathematical  facts  have  ever  been  found  without  some 
preliminary  empirical  work  of  one  form  or  another.  Before  a  mathematical 
statement  can  be  proved  or  disproved  by  deduction,  it  must  first  be  thought 
of,  or  conjectured,  and  a  conjecture  is  nothing  but  a  guess  made  more  or  less 
plausible  by  intuition,  observation,  analogy,  experimentation,  or  some  other 
form  of  empirical  procedure.  Deduction  is  a  convincing  formal  mode  of 
exposition,  but  it  is  hardly  a  means  of  discovery.  It  is  a  set  of  complicated 
machinery  that  needs  material  to  work  upon,  and  the  material  is  frequently 
furnished  by  empirical  considerations.  Even  the  steps  of  a  deductive  proof 
or  disproof  are  not  dictated  to  us  by  the  deductive  apparatus  itself,  but  must 
be  arrived  at  by  trial  and  error,  experience,  and  shrewd  guessing.  Indeed, 


*David  Hilbert  (1862-1943). 
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skill  in  the  art  of  good  guessing  is  one  of  the  prime  ingredients  in  the  make-up 
of  a  worthy  mathematician.  What  is  important  here  is  that  the  Greeks 
insisted  that  a  conjectured  or  laboratory-obtained  mathematical  statement 
must  be  followed  up  with  a  rigorous  proof  or  disproof  by  deduction,  and  that 
no  amount  of  verification  by  experiment  is  sufficient  to  establish  the  statement. 

It  is  difficult  to  give  a  wholly  adequate  explanation  of  just  why  the 
Greeks  of  600  to  400  b.c.  decided  to  abandon  empirical  methods  of  estab¬ 
lishing  mathematical  knowledge  and  to  insist  that  all  mathematical  conclu¬ 
sions  be  established  only  by  deductive  reasoning.  This  completely  new 
viewpoint  on  mathematical  method  is  usually  explained  by  the  peculiar 
mental  bias  of  the  Greeks  of  classical  times  toward  philosophical  inquiries. 
In  philosophical  speculations,  reasoning  centers  about  abstract  concepts  and 
broad  generalizations,  and  is  concerned  with  inevitable  conclusions  following 
from  assumed  premises.  Now  the  empirical  method  affords  no  way  of  dis¬ 
criminating  between  a  valid  and  an  invalid  argument,  and  so  is  hardly 
applicable  to  philosophic  considerations.  It  is  deductive  reasoning  that 
philosophers  find  to  be  their  indispensable  tool,  and  so  the  Greeks  naturally 
gave  preference  to  this  method  when  they  began  to  consider  mathematics. 

Another  explanation  of  the  Greek  preference  for  deduction  stems  from 
the  Hellenic  love  for  beauty.  Appreciation  of  beauty  is  an  intellectual  as  well 
as  an  emotional  experience,  and  from  this  point  of  view  the  orderliness, 
consistency,  completeness,  and  conviction  found  in  deductive  argument  are 
very  satisfying. 

A  still  further  explanation  for  the  Greek  preference  for  deductive  pro¬ 
cedures  has  been  found  in  the  nature  of  Greek  society  in  classical  times. 
Philosophers,  mathematicians,  and  artists  belonged  to  a  social  class  that  in 
general  disdained  manual  work  and  practical  pursuits,  which  were  carried 
on  by  a  large  slave  class.  In  Greek  society  the  slave  class  ran  the  businesses 
and  managed  the  industries,  took  care  of  households,  and  did  both  the 
technical  and  the  unskilled  work  of  the  time.  This  slave  basis  naturally 
fostered  a  separation  of  theory  from  practice  and  led  the  members  of  the 
privileged  class  to  a  preference  for  deduction  and  abstraction  and  a  disdain 
for  experimentation  and  practical  application. 

It  is  disappointing  that,  unlike  the  situation  with  ancient  Egyptian 
and  Babylonian  mathematics,  there  exist  virtually  no  source  materials  for 
contemporary  study  that  throw  much  light  upon  early  Greek  mathematics. 
We  are  forced  to  rely  upon  manuscripts  and  accounts  that  are  dated  several 
hundred  years  after  the  original  treatments  were  written.  In  spite  of  this 
difficulty,  however,  scholars  of  classicism  have  been  able  to  build  up  a  rather 
consistent,  though  somewhat  hypothetical,  account  of  the  history  of  early 
Greek  mathematics,  and  have  even  plausibly  restored  many  of  the  original 
Greek  texts.  This  work  required  amazing  ingenuity  and  patience;  it  was 
carried  through  by  painstaking  comparisons  of  derived  texts  and  by  the 
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examination  of  countless  literary  fragments  and  scattered  remarks  made  by 
later  authors,  philosophers,  and  commentators. 

Our  principal  source  of  information  concerning  very  early  Greek 
mathematics  is  the  so-called  Eudemian  Summary  of  Proclus.  This  summary 
constitutes  a  few  pages  of  Proclus’  Commentary  on  Euclid,  Book  /,  and  is  a 
very  brief  outline  of  the  development  of  Greek  geometry  from  the  earliest 
times  to  Euclid.  Although  Proclus  lived  in  the  fifth  century  a.d.,  a  good 
thousand  years  after  the  inception  of  Greek  mathematics,  he  still  had  access 
to  a  number  of  historical  and  critical  works  which  are  now  lost  to  us  except 
for  the  fragments  and  allusions  preserved  by  him  and  others.  Among  these 
lost  works  was  apparently  a  full  history  of  Greek  geometry,  covering  the 
period  before  335  b.c.,  written  by  Eudemus,  a  pupil  of  Aristotle.  The 
Eudemian  Summary  is  so  named  because  it  is  based  upon  this  earlier  work. 

According  to  the  Eudemian  Summary,  Greek  mathematics  appears  to  have 
started  in  an  essential  way  with  the  work  of  Thales  of  Miletus  in  the  first 
half  of  the  sixth  century  b.c.  This  versatile  genius,  declared  to  be  one  of  the 
“seven  wise  men”  of  antiquity,  was  a  worthy  founder  of  systematic  mathe¬ 
matics  and  is  the  first  known  individual  with  whom  the  use  of  deductive 
methods  in  mathematics  is  associated.  Thales,  the  summary  tells  us,  sojourned 
for  a  time  in  Egypt  and  brought  back  geometry  with  him  to  Greece,  where 
he  began  to  apply  to  the  subject  the  deductive  procedures  of  philosophy.  In 
particular,  he  is  credited  with  the  following  elementary  geometrical  results: 

1.  A  circle  is  bisected  by  any  diameter. 

2.  The  base  angles  of  an  isosceles  triangle  are  equal. 

3.  Vertical  angles  formed  by  two  intersecting  straight  lines  are  equal. 

4.  Two  triangles  are  congruent  if  two  angles  and  a  side  in  one  are  equal 
respectively  to  two  angles  and  the  corresponding  side  of  the  other.  (It 
is  thought  that  Thales  used  this  result  to  determine  the  distance  of  a 
ship  from  shore.) 

5.  An  angle  inscribed  in  a  semicircle  is  a  right  angle.  (The  Babylonians  of 
some  1400  years  earlier  were  acquainted  with  this  geometrical  fact.) 

We  are  not  to  measure  the  value  of  these  results  by  their  content  but  rather 
by  the  belief  that  Thales  supported  them  with  a  certain  amount  of  logical 
reasoning  instead  of  intuition  and  experiment.  For  the  first  time  a  student 
of  mathematics  was  committed  to  a  form  of  deductive  reasoning,  crude  and 
incomplete  though  it  may  have  been.  Moreover,  the  fact  that  the  first 
deductive  thinking  was  done  in  the  field  of  geometry  instead  of  algebra,  for 
instance,  inaugurated  a  tradition  in  mathematics  which  was  maintained,  as 
we  shall  see,  until  very  recent  times. 

The  next  outstanding  Greek  mathematician  mentioned  in  the  Eudemian 
Summary  is  Pythagoras,  who  is  claimed  to  have  continued  the  purification 
of  geometry  that  was  begun  some  fifty  years  earlier  by  Thales.  Pythagoras 
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was  born  about  572  b.c.  on  the  island  of  Samos,  one  of  the  Aegean  islands 
near  Thales’  home  city  of  Miletus,  and  it  is  quite  possible  that  he  studied 
under  the  older  man.  It  seems  that  Pythagoras  then  visited  Egypt  and  perhaps 
traveled  even  more  extensively  about  the  orient.  When,  on  returning  home, 
he  found  Ionia  under  Persian  dominion,  he  decided  to  migrate  to  the  Greek 
seaport  of  Crotona  in  southern  Italy.  Here  he  founded  the  celebrated 
Pythagorean  school,  a  brotherhood  knit  together  with  secret  and  cabalistic 
rites  and  observances,  and  committed  to  the  study  of  philosophy,  mathe¬ 
matics,  and  natural  science. 

The  philosophy  of  the  Pythagorean  school  was  built  upon  the  mystical 
assumption  that  whole  number  is  the  cause  of  the  various  qualities  of  man 
and  matter.  This  oriental  outlook,  perhaps  acquired  by  Pythagoras  in  his 
eastern  travels,  led  to  the  exaltation  and  study  of  number  relations  and  to  a 
perpetuation  of  numerological  nonsense  that  has  lasted  even  into  modern 
times.  However,  in  spite  of  the  unscientific  nature  of  much  of  Pythagorean 
study,  members  of  the  society  contributed,  during  the  two  hundred  or  so 
years  following  the  founding  of  their  organization,  a  good  deal  of  sound 
mathematics.  They  developed  the  properties  of  parallel  lines  and  used  them 
to  prove  that  the  sum  of  the  angles  of  any  triangle  is  equal  to  two  right  angles. 
They  contributed  in  a  noteworthy  manner  to  Greek  geometrical  algebra; 
they  effected  the  geometrical  equivalent  of  addition,  subtraction,  multipli¬ 
cation,  division,  extraction  of  roots,  and  even  the  complete  solution  of  the 
general  quadratic  equation  insofar  as  it  has  real  roots.  They  developed  a 
fairly  complete  theory  of  proportion,  though  it  was  limited  only  to  com¬ 
mensurable  magnitudes,  and  used  it  to  deduce  properties  of  similar  figures. 
They  were  aware  of  the  existence  of  at  least  three  of  the  regular  polyhedral 
solids,  and  they  discovered  the  incommensurability  of  a  side  and  a  diagonal 
of  a  square.  Although  much  of  this  information  was  already  known  to  the 
Babylonians  of  earlier  times,  the  deductive  aspect  of  mathematics  is  thought 
to  have  been  considerably  exploited  and  advanced  in  this  work  of  the 
Pythagoreans.  Chains  of  propositions  in  which  successive  propositions  were 
derived  from  earlier  ones  in  the  chain  began  to  emerge.  As  the  chains 
lengthened,  and  some  were  tied  to  others,  the  bold  idea  of  developing  all  of 
geometry  in  one  long  chain  suggested  itself.  It  is  claimed  in  the  Eudemian 
Summary  that  the  Pythagorean,  Hippocrates  of  Chios,  *  was  the  first  to 
attempt,  with  at  least  partial  success,  a  logical  presentation  of  geometry  in 
the  form  of  a  single  chain  of  propositions  based  upon  a  few  initial  definitions 
and  assumptions. 

The  famous  Greek  philosopher,  Plato,  was  strongly  influenced  by  the 
Pythagoreans,  and  Plato,  in  turn,  exerted  a  considerable  influence  on  the 

*Not  to  be  confused  with  Hippocrates  of  Cos,  the  eminent  Greek  physician  of 

antiquity. 
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development  of  mathematics  in  Greece.  Plato’s  influence  was  not  due  to 
any  mathematical  discoveries  he  made  but  rather  to  his  enthusiastic  con¬ 
viction  that  the  study  of  mathematics  furnished  the  finest  training  field  for 
the  mind,  and  hence  was  essential  in  the  cultivation  of  philosophers  and  those 
who  should  govern  his  ideal  state.  This  belief  explains  the  renowned  motto 
over  the  door  of  his  Academy,  “Let  no  one  unversed  in  geometry  enter  here.” 
Thus,  because  of  its  logical  element  and  the  pure  attitude  of  mind  that  he 
felt  its  study  creates,  mathematics  seemed  of  utmost  importance  to  Plato, 
and  for  this  reason  it  occupied  a  valued  place  in  the  curriculum  of  the 
Academy.  Some  see  in  certain  of  Plato’s  dialogues  what  may  perhaps  be 
considered  the  first  serious  attempt  at  a  philosophy  of  mathematics.  Certainly 
mathematics  in  Greece  at  the  time  of  Plato  had  advanced  a  long  way  from 
the  empirical  mathematics  of  ancient  Egypt  and  Babylonia. 


1.4  Material  Axiomatics 

Much  was  accomplished  by  the  Greeks  during  the  three  hundred  years 
between  Thales  in  600  b.c.  and  Euclid  in  300  b.c.  Not  only  did  the  Pytha¬ 
goreans  and  others  develop  the  material  that  ultimately  was  organized  into 
the  Elements  of  Euclid,  but  there  were  developed  notions  concerning  infini¬ 
tesimals  and  summation  processes  (notions  which  did  not  attain  final 
clarification  until  the  rigorization  of  the  calculus  in  modern  times)  and  also 
considerable  higher  geometry  (the  geometry  of  curves  other  than  the  circle 
and  the  straight  line  and  of  surfaces  other  than  the  sphere  and  plane). 
Curiously  enough,  much  of  this  higher  geometry  originated  in  continued 
attempts  to  solve  the  three  famous  construction  problems  of  antiquity— the 
duplication  of  a  cube,  the  trisection  of  an  arbitrary  angle,  and  the  quadrature 
of  a  circle— illustrating  the  principle  that  the  growth  of  mathematics  is 
stimulated  by  the  presence  of  outstanding  unsolved  problems. 

Also,  some  time  during  the  first  three  hundred  years  of  Greek  mathe¬ 
matics,  there  developed  the  Greek  notion  of  a  logical  discourse  as  a  sequence 
of  statements  obtained  by  deductive  reasoning  from  an  accepted  set  of  initial 
statements.  Certainly,  if  one  is  going  to  present  an  argument  by  deductive 
procedure,  any  statement  of  the  argument  will  have  to  be  derived  from  some 
previous  statement  or  statements  of  the  argument,  and  such  a  previous 
statement  must  itself  be  derived  from  some  still  more  previous  statement  or 
statements.  Clearly  this  cannot  be  continued  backward  indefinitely,  nor 
should  one  resort  to  illogical  circularity  by  deriving  statement  q  from  state¬ 
ment  /?,  and  then  later  deriving  statement  p  from  statement  q.  The  only  way 
out  of  the  difficulty  is  to  set  down,  toward  the  start  of  the  discourse,  a  collec¬ 
tion  of  fundamental  statements  whose  truths  are  to  be  accepted,  and  then 
to  proceed  by  purely  deductive  reasoning  to  derive  all  the  other  statements 
of  the  discourse.  Now  both  the  initial  and  the  derived  statements  of  the 
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discourse  are  statements  about  the  technical  matter  of  the  discourse,  and 
hence  involve  special  or  technical  terms.  The  meanings  of  these  terms  must 
be  made  clear  to  the  reader,  and  so,  the  Greeks  felt,  the  discourse  should 
start  with  a  list  of  explanations  and  definitions  of  these  technical  terms.  After 
these  explanations  and  definitions  have  been  given,  the  initial  statements, 
called  axioms  and/or  postulates  of  the  discourse,  are  to  be  listed.  These  initial 
statements,  according  to  4he  viewpoint  held  by  some  of  the  Greeks,  should 
be  so  carefully  chosen  that  their  truths  are  quite  acceptable  to  the  reader  in 
view  of  the  explanations  and  definitions  already  cited. 

A  discourse  which  is  conducted  according  to  the  above  plan  is  described 
today  as  a  development  by  material  axiomatic s.  Certainly  the  most  outstanding 
contribution  of  the  early  Greeks  to  mathematics  was  the  formulation  of 
axiomatic  procedure  and  the  insistence  that  mathematics  be  systematized 
by  such  a  procedure.  Euclid’s  Elements  is  the  earliest  extensively  developed 
example  of  axiomatic  procedure  that  has  come  down  to  us;  it  largely  follows 
the  pattern  of  material  axiomatics,  and  we  shall  certainly  want  to  examine 
it  in  some  detail.  In  more  recent  years,  the  pattern  of  material  axiomatics 
has  been  significantly  generalized  to  yield  a  more  abstract  form  of  discourse 
known  as  formal  axiomatics  (see  Chapter  6).  For  the  time  being  we  will  content 
ourselves  by  summarizing  the  pattern  of  material  axiomatics. 

Pattern  of  Material  Axiomatics 

1.  Initial  explanations  of  certain  basic  technical  terms  of  the  discourse  are 
given,  with  the  intention  of  suggesting  to  the  reader  what  these  basic 
terms  are  to  mean  in  the  discourse. 

2.  Certain  primary  statements  are  listed  concerning  the  basic  terms,  which 
are  felt  to  be  acceptable  as  true  on  the  basis  of  the  properties  suggested 
by  the  initial  explanations.  These  primary  statements  are  called  the 
axioms  and/or  postulates  of  the  discourse. 

3.  All  subsequent  technical  terms  of  the  discourse  are  defined  by  means 
of  previous  ones. 

4.  All  subsequent  statements  about  the  technical  terms  of  the  discourse 
are  logically  deduced  from  previous  statements.  We  shall  call  these 
derived  statements  the  theorems  of  the  discourse. 

To  gain  a  feeling  for  the  pattern  of  material  axiomatics,  let  us  consider 
an  example.  Suppose  one  is  faced  with  the  task  of  developing  a  logical  dis¬ 
course  on  carpentry.  The  subject  of  carpentry  contains  many  special  or 
technical  terms,  such  as  nail,  spike,  brad,  screw,  wood,  hard  wood,  soft 
wood,  board,  strut,  beam,  hammer,  saw,  screw  driver,  plane,  or  chisel.  Some 
of  these  technical  terms  can  be  defined  in  terms  of  others.  For  example,  a 
spike  and  a  brad  can  each  be  defined  as  a  special  kind  of  nail;  hard  wood  and 
soft  wood  can  be  defined  as  certain  special  kinds  of  wood;  board,  strut,  and 
beam  can  be  defined  as  pieces  of  wood  of  certain  shapes  used  for  certain 


16  MATHEMATICS  BEFORE  EUCLID 


purposes;  various  kinds  of  hammers  and  saws  can  be  defined  in  terms  of  the 
basic  hammer  and  saw.  It  is  certainly  logical,  then,  to  commence  the  dis¬ 
course  with  some  sort  of  explanation  or  description  of  the  basic  technical 
terms,  say  nail,  wood,  hammer,  saw,  and  others,  and  then  to  define  further 
technical  terms,  either  at  the  start  or  as  needed,  in  terms  of  the  basic  ones. 
After  giving  these  initial  explanations  and  possible  definitions,  the  next  thing 
to  do  is  to  list  some  fundamental  statements  about  the  explained  and  defined 
terms  that  will  be  assumed  so  that  the  discourse  may  get  under  way.  Now 
these  assumed  statements,  from  the  point  of  view  of  material  axiomatics, 
should  be  such  that  the  reader  is  perfectly  willing  to  accept  them  on  account 
of  the  initial  explanations  of  the  basic  terms  involved.  For  example,  one  may 
wish  to  assume  that  it  is  always  possible  to  drive  a  nail  with  a  hammer  into  a  piece 
of  wood ,  that  it  is  always  possible  with  a  saw  to  cut  a  piece  of  wood  in  two  by  a 
planar  cut ,  etc.  That  two  boards  of  desired  lengths  can  be  fastened  together 
with  nails  now  follows  as  a  consequence  of  these  assumptions,  and  is  thus  a 
theorem  of  the  discourse.  Probably  enough  has  been  said  to  illustrate  the 
Greek  notion  of  material  axiomatics. 

The  theory  of  some  simple  games  can  be  rather  easily  developed  by 
material  axiomatics.  Consider,  for  example,  the  familiar  game  of  tic-tac-toe. 
Among  the  technical  terms  of  this  game  are  the  playing  board,  nought,  cross, 
a  win,  a  draw,  etc.  These  technical  terms  are  to  be  explained  or  defined. 
The  rules  of  the  game  are  then  stated  as  the  postulates  of  the  discourse,  these 
rules  being  perfectly  acceptable  once  one  understands  the  basic  terms  of  the 
discourse.  From  these  rules  one  can  then  proceed  to  deduce  the  theory  of 
the  game,  proving  as  a  theorem,  for  example,  that  with  sufficiently  good  playing, 
the  player  who  starts  a  game  need  not  lose  the  game. 

1.5  The  Origin  of  the  Axiomatic 

Method 

We  do  not  know  with  whom  the  axiomatic  method  originated.  By  the 
account  given  in  the  Eudemian  Summary ,  the  method  seems  to  have  evolved 
with  the  Pythagoreans  as  a  natural  outgrowth  and  refinement  of  the  early 
application  of  deductive  procedures  to  mathematics.  This  is  the  traditional 
and  customary  account  and  is  based  principally  upon  Proclus’  summary, 
which,  in  turn,  is  based  upon  the  lost  history  of  geometry  written  by  Eudemus 
about  335  b.c.  The  account  may  be  the  true  one,  and,  if  so,  we  must  concede 
to  the  Pythagoreans  a  very  high  place  in  the  history  of  the  development  of 
mathematics. 

There  are  some  historians  of  ancient  mathematics  who  find  the  account 
of  the  early  history  of  Greek  mathematics,  as  reconstructed  from  the  Eudemian 
Summary ,  somewhat  difficult  to  believe,  and  who  feel  that  the  traditional 
stories  about  Thales  and  Pythagoras  must  be  discredited  as  purely  legendary 


Sec.  1.5  THE  ORIGIN  OF  THE  AXIOMATIC  METHOD  17 


and  unhistorical  in  content.  For  example,  the  Eudemian  Summary  says  that 
Thales  proved  that  a  circle  is  bisected  by  any  one  of  its  diameters.  The 
realization  that  so  obvious  a  matter  as  this  should  need  demonstration  seems 
to  reflect  a  mathematical  sophistication  of  a  much  more  advanced  period, 
when  the  importance  and  delicacy  of  initial  assumptions  had  become  much 
clearer.  Eudemus  may  have  hypothetically  restored  the  sequence  of  events 
so  that  they  accorded  with  the  state  of  the  theory  of  his  time,  as  many  his¬ 
torians  do  when  source  material  is  not  available.  Actually,  we  can  have  very 
little  idea  of  the  roles  played  in  the  history  of  mathematics  by  Thales  and 
Pythagoras,  and  it  may  be  much  closer  to  reality  to  assume  that  early  Greek 
mathematics  cannot  have  differed  greatly  from  the  oriental  type.  An  essential 
turn  in  the  development  of  a  subject  is  usually  brought  about  by  some  crucial 
circumstance,  and  in  mathematics  such  a  circumstance  arose  some  time  in 

the  fifth  century  b.c.  with  the  devastating  discovery  of  the  irrationality  of  V 2. 

Let  us  pause  a  moment  to  consider  the  significance  of  the  last  statement. 
Since  the  rational  numbers  consist  of  all  numbers  of  the  form  p[q ,  where  p 
and  q  are  integers  with  q  ^  0,  the  discovery  alluded  to  states  that  there  are  no 

integers  p  and  q  such  that  plq  =  ^2;  that  is,  V2  is  not  a  rational  number 
and  hence,  by  definition,  is  an  irrational  number.  The  traditional  proof  of 
this  fact,  apparently  known  to  Aristotle  (384-322  b.c.),  is  simple  and  runs  as 
follows:  Suppose,  on  the  contrary,  that  there  are  two  integers  p  and  q  such 

that  plq  =  V2,  where,  without  any  loss  of  generality,  we  may  assume  that  p 
and  q  have  no  common  positive  integral  factor  other  than  unity.  Then 
p2  _  2q2.  Since  p 2  is  twice  an  integer,  we  see  that  />2,  and  hence  /?,  must  be 
even.  So  we  may  put/?  =  2 r.  Then  we  find  4r2  =  2q 2,  or  2 r2  =  q 2,  from  which 
we  conclude  that  q 2,  and  hence  q ,  must  be  even.  But  this  is  impossible,  since 
we  assumed  that  p  and  q  have  no  common  integral  factor  different  from  unity. 
The  supposition  that  M 2  is  rational  has  led  to  a  contradictory  situation, 

whence  it  follows  that  V 2  must  be  irrational.  This  result  was  surprising  and 
disturbing  on  several  grounds.  First  of  all,  it  seemed  to  deal  a  mortal  blow  to 
the  Pythagorean  philosophy  that  all  depends  upon  the  integers.  Next,  it 
seemed  contrary  to  common  sense,  for  it  was  felt  intuitively  that  any  magnitude 
could  be  expressed  by  some  rational  number.  The  geometrical  counterpart 
was  equally  startling,  for  who  could  doubt  that  for  any  two  given  line  seg¬ 
ments  one  is  able  to  find  some  third  line  segment,  perhaps  very  very  small, 
that  can  be  marked  off  a  whole  number  of  times  into  each  of  the  given 
segments.  But  take  as  the  two  given  segments  a  side  and  a  diagonal  d  of  a 
square.  Now  if  there  exists  a  third  segment  t  which  can  be  marked  off  a  whole 
number  of  times  into  i  and  d  we  would  have  s  —  qt  and  d  =  pt ,  where  p  and  q 
are  integers.  But  d  —  jV7 2,  whence  pt  =  qt^S 2,  or  vM  =  p/q ,  a  rational 
number.  Contrary  to  intuition,  then,  there  exist  line  segments  having  no 
common  unit  of  measure.  But  the  whole  Pythagorean  theory  of  proportion 
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was  built  upon  the  seemingly  obvious  assumption  that  any  two  line  segments 
are  commensurable,  that  is,  do  have  some  common  unit  of  measure. 

No  wonder  the  discovery  of  the  irrationality  of  V2  led  to  some  con¬ 
sternation  in  the  Pythagorean  ranks.  The  situation  must  have  caused  a 
profound  reaction  in  mathematical  thinking,  and  must  have  very  consider¬ 
ably  emphasized  the  extreme  importance  of  careful  agreement  on  what  can 
be  taken  for  basic  assumptions.  A  crisis,  like  this  one  of  the  discovery  of 
irrational  numbers,  could  well  account  for  the  origin  of  the  axiomatic 
method,  and,  if  so,  the  credit  for  the  invention  might  largely  go  to  Eudoxus, 
the  genius  of  the  time  who  finally  resolved  the  crisis  that  had  arisen.* 

This  second  explanation  of  the  possible  origin  of  the  axiomatic  method 
has  other  points  in  its  favor.  For  example,  it  places  less  stress  on  any  peculiar 
mentality  possessed  by  the  Greeks  of  very  early  times,  and  it  accounts  for 
the  relatively  large  number  of  Greek  papyrus  fragments  containing  texts 
after  the  pattern  of  oriental  mathematics.  These  texts,  like  the  similar  ones 
from  Babylonian  times,  probably  formed  the  backbone  of  instruction  in 
elementary  mathematics.  At  this  elementary  level  the  highly  sophisticated 
axiomatic  method  had  as  little  influence  as  it  has  today  in  much  of  our 
elementary  teaching.  Writings  of  this  sort,  then,  do  not  reflect  any  degenera¬ 
tion  of  the  so-called  Greek  spirit  in  mathematics  but  simply  exhibit  the 
continuance,  on  an  elementary  level,  of  older  traditions.  Heron’s  geometry, 
for  example,  can  be  properly  considered  a  Hellenistic  form  of  oriental 
tradition;  it  should  not  be  regarded  as  a  sign  of  decline  in  Greek  mathematics 
just  because  it  does  not  employ  the  refined  procedures  of  the  axiomatic 
method. 

Perhaps  it  is  needless  to  hypothesize  about  the  origin  of  the  axiomatic 
method.  Certainly,  by  the  middle  of  the  fourth  century  b.c.,  the  method  had 
been  fairly  well  developed,  for  in  Aristotle’s  Analytica  posterior  a ,  we  find  a 
good  deal  of  light  thrown  upon  some  of  its  features.  Aristotle  was  not  a 
mathematician,  but,  as  the  systematizer  of  classical  logic,  he  found  in 
elementary  mathematics  excellent  models  of  logical  reasoning,  and  his 
mathematical  illustrations  tell  us  a  great  deal  about  the  principles  of  the 
axiomatic  method  as  accepted  in  his  time.  By  the  turn  of  the  century  the 
stage  was  set  for  Euclid’s  magnificent  and  epoch-making  application  of  the 
axiomatic  method. 

_ PROBLEMS 

1.1.1  In  the  Rhind  papyrus  the  area  of  a  circle  is  taken  as  equal  to  that 
of  a  square  on  8/9  of  the  circle’s  diameter.  Show  that  this  is  equivalent  to  tak¬ 
ing  7r  =  3.1604  •  •  • . 


*See  Appendix,  Section  A. 6. 
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1.1.2  The  Sulvasutras ,  ancient  Hindu  religious  writings  dating  from  about 
500  b.c.,  are  of  interest  in  the  history  of  mathematics  because  they  embody 
certain  geometrical  rules  for  the  construction  of  altars  and  show  an  acquaint¬ 
ance  with  the  Pythagorean  theorem.  Among  the  rules  furnished  there  appear 
empirical  solutions  of  the  circle-squaring  problem  which  are  equivalent  to 

taking  d  =  (2  +  V2)s/3  and  s  =  13^/15,  where  d  is  the  diameter  of  the 
circle  and  s  is  the  side  of  the  equal  square.  These  formulas  are  equivalent 
to  taking  what  values  for  tt? 

1.1.3  Show  that  the  ancient  Babylonian  formula  K  =  (a  +  c)(b  +  d)/4, 
for  the  area  of  a  quadrilateral  having  a ,  b,  c,  d  for  consecutive  sides,  gives  too 
large  an  answer  for  all  nonrectangular  quadrilaterals. 

1.1.4  ( for  students  who  have  studied  calculus ).  Prove,  by  elementary  differential 
calculus,  the  farmer’s  conjecture  that  the  rectangular  field  of  maximum  area 
lying  along  a  straight  river  bank  and  utilizing  a  given  amount  of  fencing 
has  a  length  that  is  twice  the  depth  of  the  field. 

1.1.5  A  disc  of  radius  R  spins  vertically  on  a  horizontal  axis  held  above 
the  surface  of  a  liquid.  As  the  disc  spins  it  cuts  into  the  liquid.  Estimate,  by 
empirical  methods,  how  high  the  axis  must  be  above  the  liquid’s  surface  so 
that  the  wetted  area  of  the  spinning  disc  above  the  surface  of  the  liquid  shall 
be  a  maximum.  (This  problem  arose  in  the  manufacture  of  fruit  syrups  from 
fruit  juices,  and  was  solved  empirically  by  the  manufacturer,  who  found  the 
required  height  r  of  the  axis  above  the  surface  of  the  liquid  to  be  about 
(3/10)/?.  It  is  not  very  difficult  to  show  by  differential  calculus  that 
r  =  R/(  1  +  7r2)1/2.  It  is  interesting  that  the  General  Electric  Company  began 
studies  of  this  evaporation  method  in  the  early  1960’s  in  connection  with 
the  design  of  a  diffusion  still.) 

1.1.6  Two  ladders,  60  ft  long  and  40  ft  long,  lean  from  opposite  sides 
across  an  alley  lying  between  two  buildings,  the  feet  of  the  ladders  resting 
against  the  bases  of  the  buildings.  If  the  ladders  cross  each  other  at  a  height 
of  10  ft  above  the  alley,  how  wide  is  the  alley?  Solve  this  problem  empirically 
from  drawings.  (An  algebraic  treatment  of  this  problem  requires  the  solution 
of  a  quartic  equation.  If  a  and  b  represent  the  lengths  of  the  ladders,  c  the 
height  at  which  they  cross,  and  x  the  width  of  the  alley,  it  can  be  shown 
that  (a2  -  x2)-112  +  ( b 2  -  x2)-112  =  c-\) 

1.1.7  How  good  is  the  following  empirical  straightedge  and  compass 
trisection  of  an  angle  of  30°?  Let  AOB  be  the  given  angle,  with  OA  =  OB. 
On  AB  as  diameter  draw  a  semicircle  lying  on  the  same  side  of  AB  as  is  the 
point  0.  Take  D  and  E  on  the  semicircle  such  that  AD  —  DE  =  EB.  Take 
F  on  DE  such  that  DF  =  DE/ 4.  Then  OF  is  the  sought  trisector. 

1.1.8  Solve,  by  the  rule  of  false  position,  the  following  problem  found  in 
the  Rhind  papyrus:  “A  quantity,  its  2/3,  its  1/2,  and  its  1/7,  added  to¬ 
gether,  become  33.  What  is  the  quantity?” 

1.1.9  Find  the  length  of  side  BC  in  the  quadrilateral  pictured  in  Figure  6. 
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1.1.10  In  the  study  of  geometrical  constructions  there  is  a  counterpart  of 
the  rule  of  false  position,  generally  known  as  the  method  of  similitude.  The 
method  lies  in  constructing  a  figure  similar  to  the  one  desired,  and  then, 
by  the  use  of  proportion,  “blowing  it  up”  to  proper  size.  Suppose,  for 
example,  we  wish  to  inscribe  a  square  in  a  given  triangle  ABC  so  that  one 
side  of  the  square  lies  along  the  base  BC  of  the  triangle  (see  Figure  7).  First 


A 


draw  a  square  D'E'F'G'  of  any  convenient  size,  as  indicated.  If  F'  falls  on 
AC,  the  problem  is  solved.  Otherwise  we  have  solved  the  problem  for  a 
triangle  A' BC'  similar  to  triangle  ABC  and  having  B  as  a  center  of  similitude. 
It  follows  that  line  BF'  cuts  AC  in  the  vertex  F  of  the  sought  square  inscribed 
in  triangle  ABC. 

Construct,  by  the  method  of  similitude,  a  line  segment  DE,  where  D 
is  on  side  AB  and  E  on  side  AC  of  a  given  triangle  ABC,  so  that  BD  =  DE  = 
EC. 

1.1.11  In  the  Rhind  papyrus  we  find,  “If  you  are  asked,  what  is  2/3  of  1  /5, 
take  the  double  and  the  sixfold;  that  is  2/3  of  it.  One  must  proceed  likewise 
for  any  other  fraction.”  Interpret  this  and  prove  the  general  statement. 

1.1.12  In  the  Moscow  papyrus  we  find  the  following  numerical  example: 
“If  you  are  told:  A  truncated  pyramid  of  6  for  the  vertical  height  by  4  on 
the  base  by  2  on  the  top.  You  are  to  square  this  4,  result  16.  You  are  to 
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double  4,  result  8.  You  are  to  square  2,  result  4.  You  are  to  add  the  16, 
the  8,  and  the  4,  result  28.  You  are  to  take  one  third  of  6,  result  2.  You  are 
to  take  28  twice,  result  56.  See,  it  is  56.  You  will  find  it  right.”  Show  that 
this  illustrates  the  general  formula 

h(a 2  +  ab  +  b 2) 

3 

giving  the  volume  of  a  frustum  of  a  square  pyramid  in  terms  of  the  height  h 
and  the  sides  a  and  b  of  the  bases. 

1.1.13  Interpret  the  following,  found  on  a  Babylonian  tablet  dating  from 
about  2600  b.c.:  “60  is  the  circumference,  2  is  the  perpendicular,  find  the 
chord.”  “Thou,  double  2  and  get  4,.  dost  thou  not  see?  Take  4  from  20, 
thou  gettest  16.  Square  20,  thou  gettest  400.  Square  16,  thou  gettest  256. 
Take  256  from  400,  thou  gettest  144.  Find  the  square  root  of  144.  12,  the 
square  root,  is  the  chord.  Such  is  the  procedure.” 

1.1.14  (a)  A  Babylonian  tablet  has  been  discovered  which  gives  the  values 
of  ns  +  n2  for  n  —  1  to  30.  Make  such  a  table  for  n  —  1  to  n  =  10,  and  use  it 
to  find  a  root  of  the  cubic  equation  x3  +  2x2  —  3136  =  0. 

(b)  A  Babylonian  problem  of  about  1800  b.c.  seems  to  call  for  the  solu¬ 
tion  of  the  simultaneous  system  xyz  +  xy  —  7 / 6,  y  —  2x/3,  z  —  12*.  Solve 
this  system  using  the  table  of  part  (a). 

1.1.15  It  is  known  that  the  infinite  series  obtained  by  expanding  ( a 2  +  h) 1/2 
by  the  process  of  the  binomial  theorem  converges  to  ( a 2  +  h) 1/2  if  — a 2  < 
h  <  a2. 

(a)  Establish  the  approximation  formula 

h 

(a2  +  h)112  *  a  +  — ,  0  <  h  <  a2. 

(b)  Take  a  =  4/3  and  h  =  2/9  in  the  approximation  formula  of  part  (a), 
and  thus  find  a  Babylonian  rational  approximation  for  x/ 2.  Find  a  rational 
approximation  for  V5  by  taking  a  =  2,  h  =  1 . 

1.1.16  The  Ffindu  mathematician,  Aryabhata,  wrote  early  in  the  sixth 
century  a.d.  His  work  is  a  poem  of  33  couplets  called  the  Ganita.  Bellowing 
are  translations  of  two  of  the  couplets:  (1)  The  area  of  a  triangle  is  the 
product  of  the  altitude  and  half  the  base;  half  of  the  product  of  this  area 
and  the  height  is  the  volume  of  the  solid  of  six  edges.  (2)  Half  the  circum¬ 
ference  multiplied  by  half  the  diameter  gives  the  area  of  the  circle;  this  area 
multiplied  by  its  own  square  root  gives  the  volume  of  the  sphere.  Show  that, 
in  each  of  these  couplets,  Aryabhata  is  correct  in  two  dimensions  but  wrong 
in  three. 

1.1.17  An  early  Chinese  work,  dating  probably  from  the  second  century 
b.c.  and  which  had  considerable  influence  on  the  development  ol  mathe¬ 
matics  in  China,  was  the  K’ui-c/hang  Suan-shu ,  or  Arithmetic  in  Nine  Sections. 
In  this  work  we  find  the  empirical  formula  s(c  +  s) /2  for  the  area  of  a  circular 
segment  of  chord  c  and  depth  s.  Obtain  a  correct  formula  in  terms  of  these 
quantities. 
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1.1.18  (a)  Devise  an  empirical  procedure,  using  templates  and  a  balance, 
for  showing  that  the  area  under  one  arch  of  the  cycloid  curve  is  equal 
to  three  times  the  area  of  the  generating  circle.  (An  experiment  of  this  nature 
was  performed  by  Galileo  in  1599.  The  first  published  mathematical  demon¬ 
stration  that  the  area  of  a  cycloidal  arch  is  exactly  three  times  that  of  the 
generating  circle  was  furnished  in  1644,  by  Galileo’s  pupil,  Evangelista 
Torricelli.) 

(b)  Devise  an  empirical  procedure,  using  a  right  circular  cone,  a  right 
circular  cylinder  of  the  same  radius  and  altitude,  and  some  sand,  for  showing 
that  the  volume  of  a  right  circular  cone  is  one  third  the  product  of  its 
altitude  and  the  area  of  its  base. 

(c)  Devise  an  empirical  procedure,  using  a  circular  disc,  a  hemisphere 
of  the  same  radius,  and  a  long  piece  of  thick  cord,  for  showing  that  the 
area  of  a  sphere  is  equal  to  four  times  that  of  a  great  circle. 

(d)  Show  empirically,  by  folding  paper,  that  the  sum  of  the  angles  of 
a  triangle  is  equal  to  a  straight  angle. 

1.2.1  Criticize  the  following  inductions: 

(a)  Mr.  Smith  and  Mr.  Brown  were  both  born  in  January,  and  both 
suffer  from  colds.  They  resigned  themselves  to  their  fate  on  the  ground  that 
all  people  born  in  January  must  suffer  from  colds. 

(b)  John  had  never  eaten  yeast,  and  at  the  beginning  of  the  year 
weighed  120  pounds.  For  the  next  six  months  he  ate  three  yeast  cakes  a  day, 
and  at  the  end  of  that  time  weighed  150  pounds.  Therefore  eating  yeast 
makes  people  gain  weight. 

1.2.2  Criticize  the  following  inductions: 

(a)  During  a  certain  summer  someone  noted  that  the  number  of  pounds 
of  butter  sold  in  New  York  City  each  month  varied  more  or  less  directly 
with  the  number  of  inches  of  rainfall  in  New  York  City  each  month,  and 
conjectured  that  there  must  be  some  connection  between  the  two. 

(b)  During  another  summer  a  high  degree  of  correlation  was  observed 
between  the  number  of  people  each  day  at  a  beach  resort  and  the  corre¬ 
sponding  number  of  people  each  day  taking  a  boat  ride  on  a  river  leading 
from  a  large  city  to  the  beach  resort.  From  this  correlation  it  was  induced 
that  many  people  travel  to  the  beach  by  boat. 

(c)  From  an  observation  that  students  who  made  high  grades  in  English 
also  generally  made  high  grades  in  mathematics  it  was  induced  that  English 
helps  mathematics. 

(d)  Statistics  show  that  over  the  years  our  principal  roads  have  been 
made  wider  and  wider,  and  at  the  same  time  accidents  have  increased. 
Apparently  wide  roads  (perhaps  because  they  cause  people  to  drive  faster) 
are  a  cause  of  accidents. 

1.2.3  The  three  altitudes  of  a  triangle  are  concurrent.  Would  you  expect 
the  four  altitudes  of  a  tetrahedron  to  be  concurrent?  (Many  theorems  con¬ 
cerning  tetrahedra  were  first  suggested  by  the  corresponding  theorems  about 
triangles.  In  this  case,  however,  the  analogy  leads  to  an  incorrect  result. 
Only  for  the  so-called  orthocentric  tetrahedra  are  the  four  altitudes  concurrent. 
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An  orthocentric  tetrahedron  is  a  tetrahedron  each  edge  of  which  is  perpen¬ 
dicular  to  the  opposite  edge.) 

1.2.4  Two  lines  through  the  vertex  of  an  angle  and  symmetrical  with 
respect  to  the  bisector  of  the  angle  are  called  a  pair  of  isogonal  conjugate  lines 
of  the  angle.  There  is  an  attractive  theorem  about  triangles  which  states 
that  if  three  lines  through  the  vertices  of  a  triangle  are  concurrent,  then  the 
three  isogonal  conjugate^ lines  through  the  vertices  of  the  triangle  are  also 
concurrent.  Try  to  construct  an  analogous  definition  and  theorem  for  the 
tetrahedron. 

1.2.5  List  from  the  following  statements  those  that  are  equivalent  to  the 
statement,  “All  parallelograms  are  quadrilaterals”: 

(a)  Every  parallelogram  is  a  quadrilateral. 

(b)  If  a  figure  is  a  quadrilateral,  then  it  must  be  a  parallelogram. 

(c)  If  a  figure  is  not  a  quadrilateral,  then  it  is  not  a  parallelogram. 

(d)  If  a  figure  is  a  parallelogram,  then  it  surely  is  not  a  quadrilateral. 

1.2.6  List  the  following  statements  which  are  equivalent  to  the  statement, 
“When  the  sunset  is  red,  it  is  sure  to  rain  the  next  day”: 

(a)  If  it  is  raining  today,  then  the  sunset  last  evening  must  have  been 

red. 

(b)  If  it  does  not  rain  today,  then  the  sunset  last  evening  must  have 
been  red. 

(c)  If  it  does  not  rain  today,  then  the  sunset  last  evening  must  not 
have  been  red. 

(d)  Whenever  it  rains  during  the  day,  the  sunset  of  the  previous  evening 
was  red. 

1.2.7  List  the  following  statements  which  are  equivalent  to  the  statement, 
“It  never  rains  in  June”: 

(a)  If  it  is  June,  it  is  not  raining. 

(b)  If  it  is  not  raining,  it  is  not  June. 

(c)  In  June  it  never  rains. 

(d)  Never  in  June  does  it  rain. 

(e)  If  it  is  raining,  it  is  not  June. 

(f)  Sometimes  in  June  it  does  not  rain. 

1.2.8  Draw  diagrams  illustrating  each  of  the  following  types  of  categorical 
propositions: 

(a)  Universal  affirmative :  All  a  are  b. 

(b)  Universal  negative :  No  a  are  b. 

(c)  Particular  affirmative :  Some  a  are  b. 

(d)  Particular  negative:  Some  a  are  not  b. 

1.2.9  Test  the  following  arguments  for  validity: 

(a)  Premise :  All  x  ar ey. 

Conclusion :  All  non-x  are  non-jy. 

(b)  Premises-.  (1)  All  games  played  in  the  street  are  dangerous. 

(2)  No  bull  fighting  is  played  in  the  street. 

Conclusion-.  Bull  fighting  is  not  a  dangerous  game. 
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(c)  Premises :  (1)  No  *  arej;. 

(2)  Some  x  are  z. 

Conclusion :  Some  z  are  notjy. 

(d)  Premises :  (1)  All  trapezoids  are  quadrilaterals. 

(2)  All  parallelograms  are  quadrilaterals. 

Conclusion :  All  parallelograms  are  trapezoids. 

(e)  Premises :  (1)  All  useful  books  are  amusing. 

(2)  All  books  of  tables  are  useful  books. 

Conclusion:  All  books  of  tables  are  amusing. 

(f)  Premise:  All  knowledge  is  useful. 

Conclusion :  No  knowledge  is  useless. 

(g)  Premises:  (1)  Some  doctors  are  not  paid  enough. 

(2)  Some  doctors  are  college  professors. 

Conclusion:  Some  college  professors  are  not  paid  enough. 

(h)  A  student  must  study  to  deserve  good  grades.  John  studied.  There¬ 
fore  he  deserves  good  grades. 

(i)  In  a  certain  triangle  the  sum  of  the  squares  on  two  sides  equals  the 
square  on  the  third.  Hence  the  triangle  is  a  right  triangle  by  the  Pythagorean 
theorem. 

(j)  “He  that  is  of  God  heareth  God’s  words;  ye  therefore  hear  them 
not,  because  ye  are  not  of  God.”  (John  8:47.) 

(k)  “I  have  tasted  eggs,  certainly,”  said  Alice  -  •  •  “but  little  girls  eat 
eggs  quite  as  much  as  serpents  do,  you  know.” 

“I  don’t  believe  it,”  said  the  Pigeon;  “but  if  they  do,  why,  then  they’re 
a  kind  of  serpent:  that’s  all  I  can  say.”  (Lewis  Carroll,  Alice  in  Wonderland.) 

1.2.10  Let  T  stand  for  true ,  F  for  false ,  V  for  valid ,  and  /  for  invalid.  Try 
to  construct  simple  arguments  satisfying  each  of  the  following  possibilities: 

Premises  TTTTFFFF 

Argument  V  V  I  I  V  V  I  I 

Conclusion  TFTFTFTF 

1.2.11  Consider  the  following  four  statements,  called,  respectively,  the 
direct  statement,  the  converse  statement,  the  inverse  statement,  and  the  contra- 
positive  statement:  (1)  All  a  are  b,  (2)  all  b  are  a,  (3)  all  non-fl  are  non-£, 
(4)  all  non -b  are  non-a. 

(a)  Show  that  the  direct  and  contrapositive  statements  are  equivalent. 

(b)  Show  that  the  converse  and  inverse  statements  are  equivalent. 

(c)  Taking  “All  parallelograms  are  quadrilaterals”  as  the  direct  state¬ 
ment,  state  the  converse,  inverse,  and  contrapositive  statements. 

1.2.12  (a)  The  categorical  statement,  “All  a  are  bf  may  be  stated  in  the 
equivalent  hypothetical  form,  “If  w  is  an  a ,  then  w  is  a  A”  State  the  corre¬ 
sponding  converse,  inverse,  and  contrapositive  statements  in  hypothetical 
form. 

(b)  State  the  converse,  inverse,  and  contrapositive  of  “If  a  triangle  is 
isosceles,  then  the  bisectors  of  its  base  angles  are  equal.”  (The  direct  propo¬ 
sition  is  very  easily  established.  The  converse  proposition  is  known  as  the 
Steiner -Lehmus  theorem  and  is  troublesome  to  establish.  If  one  can  manage  to 
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establish  the  inverse  proposition,  then,  of  course,  by  Problem  1.2.11  (b),  the 
Steiner-Lehmus  theorem  will  follow.) 

1.3.1  We  are  told  that  Thales  measured  the  distance  of  a  ship  from  shore 
by  using  the  fact  that  two  triangles  are  congruent  if  two  angles  and  the 
included  side  of  one  are  equal  to  two  angles  and  the  included  side  of  the 
other.  The  historian,  Thomas  L.  Heath,  has  conjectured  that  this  computa¬ 
tion  was  probably  made  Ipy  an  instrument  consisting  of  two  rods  AC  and  AD, 
hinged  together  at  A.  The  rod  AD  was  held  vertically  over  a  point  B  on 
shore,  while  rod  AC  was  pointed  toward  the  ship  P.  Then,  without  changing 
the  angle  DAC,  the  instrument  was  revolved  about  AD,  and  point  Q,  noted 
on  the  ground  at  which  arm  AC  was  directed.  What  distance  must  be 
measured  in  order  to  find  the  distance  from  B  to  the  inaccessible  point  P? 

1.3.2  The  Eudemian  Summary  says  that  in  Pythagoras’  time  there  were  three 
means,  the  arithmetic ,  the  geometric,  and  the  subcontrary,  the  last  name  being 
later  changed  to  harmonic  by  Archytas  and  Hippasus.  We  may  define  these 
three  means  of  two  positive  numbers  a  and  b  as 

A  =  (a  +  b)  12,  G  =  Vab,  H  =  lab /{a  T  b), 

respectively. 

(a)  Show  that  A  ^  G  ^  H,  equality  holding  if  and  only  if  a  —  b. 

(b)  Show  that  H  is  the  harmonic  mean  between  a  and  b  if  there  exists 
a  number  n  such  that  a  —  H  -f-  aln  and  H  =  b  +  b /n.  This  was  the  Pytha¬ 
gorean  definition  of  the  harmonic  mean  of  a  and  b. 

(c)  Since  8  is  the  harmonic  mean  of  12  and  6,  Philolaus,  a  Pythagorean 
of  about  425  b.c.,  called  the  cube  a  “geometrical  harmony.”  Explain  this. 

1.3.3  Tradition  is  unanimous  in  ascribing  to  Pythagoras  the  independent 
discovery  of  the  theorem  on  the  right  triangle  which  now  universally  bears 
his  name  (the  square  on  the  hypotenuse  of  a  right  triangle  is  equal  to  the  sum 
of  the  squares  on  the  two  legs).  This  theorem  was  known  to  the  Babylonians 
of  Hammurabi’s  time,  more  than  a  thousand  years  earlier,  but  the  first 
general  proof  of  the  theorem  may  well  have  been  given  by  Pythagoras.  There 
has  been  considerable  conjecture  regarding  the  proof  Pythagoras  might  have 

b  a  a  b 


FIG.  8 

offered;  the  common  belief  is  that  it  probably  was  a  dissection  type  of  proof 
such  as  is  suggested  by  Figure  8.  Supply  the  proof.  (To  prove  that  the  central 
piece  of  the  second  dissection  is  actually  a  square  of  side  c  we  need  to  employ 
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the  fact  that  the  sum  of  the  angles  of  a  right  triangle  is  equal  to  two  right 
angles.  But  the  Eudemian  Summary  attributes  this  theorem  for  the  general 
triangle  to  the  Pythagoreans.  Since  a  proof  of  this  theorem  requires,  in  turn, 
a  knowledge  of  some  properties  of  parallels,  the  early  Pythagoreans  are  also 
credited  with  the  development  of  that  theory.) 

1.3.4  State  and  prove  the  converse  of  the  Pythagorean  theorem. 

1.3.5  Closely  allied  to  the  Pythagorean  theorem  is  the  problem  of  finding 
integers  a ,  b,  c  to  represent  the  legs  and  hypotenuse  of  a  right  triangle.  Such 
a  triple  of  numbers  is  known  as  a  Pythagorean  triple ,  and  there  is  fairly  convinc¬ 
ing  evidence  that  the  ancient  Babylonians  knew  how  to  calculate  such  triples. 

(a)  Show  that,  for  any  odd  integer  m,  the  three  numbers  m ,  (m2  —  l)/2, 
and  ( m 2  -f  l)/2  yield  a  Pythagorean  triple.  (The  Pythagoreans  have  been 
credited  with  this  discovery.) 

(b)  Show  that  for  any  integer  m,  the  three  numbers  2m,  m2  —  1,  and 
"f"  1  yield  a  Pythagorean  triple.  (This  slight  generalization  of  the  above 

result  is  attributed  to  Plato.  Neither  formula  yields  all  Pythagorean  triples. 
Not  until  Euclid  wrote  his  Elements  do  we  find  a  complete  solution  of  the 
problem.) 

1.3.6  Draw  three  unequal  line  segments.  Label  the  longest  one  a,  the 
medium  one  b ,  and  take  the  shortest  one  as  1  unit.  With  straightedge  and 
compasses  construct  line  segments  of  lengths 

(a)  a  -f-  b  and  a  —  b, 

(b)  ab, 

(c)  ajb^ 

(d)  Va, 

(e)  a/n,  n  a  positive  integer. 

1.3.7  Show  that  there  can  be  no  more  than  five  regular  polyhedra. 

1.4.1  Explain  how  the  commonly  heard  statement,  “An  axiom  is  a  self- 
evident  truth,  ’  reflects  part  of  the  pattern  of  material  axiomatics. 

1.4.2  As  a  simple  example  of  a  discourse  conducted  by  material  axiomatics, 
consider  a  certain  (finite  and  nonempty)  collection  S  of  people  and  certain 
clubs  formed  among  these  people,  a  club  being  a  (nonempty)  set  of  people 
organized  for  some  common  purpose.  Our  basic  terms  are  thus  the  collection 
S  of  people  and  the  clubs  to  which  these  people  belong.  About  these  people 
and  their  clubs  we  assume: 

Postulate  1.  Every  person  of  S  is  a  member  of  at  least  one  club. 

Postulate  2.  For  every  pair  of  people  of  S  there  is  one  and  only  one  club  to 
which  both  belong. 

Definition.  Two  clubs  having  no  members  in  common  are  called 
conjugate  clubs. 

Postulate  3.  For  every  club  there  is  one  and  only  one  conjugate  club. 

From  these  postulates  deduce  the  following  theorems: 

Theorem  1.  Every  person  of  S  is  a  member  of  at  least  two  clubs. 

Theorem  2.  Every  club  contains  at  least  two  members. 
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Theorem  3.  S  contains  at  least  four  people. 

Theorem  4.  There  exist  at  least  six  clubs. 

Theorem  5.  No  club  contains  more  than  two  members. 

1.4.3  Using  the  same  basic  terms  as  in  Problem  1.4.2,  let  us  assume: 
Postulate  1.  Any  two  distinct  clubs  have  one  and  only  one  member  in  common. 
Postulate  2.  Every  person  of  S  belongs  to  two  and  only  two  clubs. 
Postulate  3.  There1  are  exactly  four  clubs. 

From  these  postulates  deduce  the  following  theorems: 

Theorem  1.  There  are  exactly  six  people  in  S. 

Theorem  2.  There  are  exactly  three  people  in  each  club. 

Theorem  3.  For  each  person  in  S  there  is  exactly  one  other  person  in  S  not  in 
the  same  club. 

1.4.4  Establish  the  theorem  about  the  game  of  tic-tac-toe  cited  in  the  text. 

1.4.5  The  cigar  game  is  played  by  two  players  on  a  rectangular  table  top 
with  a  large  stock  of  cigars.  The  two  players,  taking  turns,  lay  (at  each  turn) 
a  cigar  on  the  table  top  so  that  it  does  not  overlap  any  other  cigar  nor  pro¬ 
trude  over  the  edge  of  the  table  top.  The  last  player  able  to  place  a  cigar 
on  the  table  top  wins  the  game.  Prove  the  following  theorem  about  this  game: 
With  proper  strategy ,  the  player  who  starts  the  game  can  win  the  game. 

1.5.1  (a)  Prove  that  the  straight  line  through  the  points  (0,0)  and  (1,  N' 2) 
passes  through  no  point,  other  than  (0,0),  of  the  coordinate  lattice. 

(b)  Show  how  the  coordinate  lattice  may  be  used  for  finding  rational 

approximations  of  V2. 

1.5.2  If  p  is  a  prime  number  and  n  is  an  integer  greater  than  1,  show  that 
v7/?  is  irrational. 

1.5.3  Give  a  purely  geometric  proof  of  the  irrationality  of  V 2. 

1.5.4  The  most  important  of  Heron’s  geometrical  works  is  his  Metnca , 
discovered  in  Constantinople  by  R.  Schone  as  recently  as  1896.  In  this  work 
is  found  Heron’s  method  of  approximating  the  square  root  of  a  nonsquare 

integer,  a  process  frequently  used  by  computers  today.  If  n  —  ab,  then  V7 n 
is  approximated  by  (a  -f-  b)  /2,  the  approximation  improving  with  the  close¬ 
ness  of  a  to  b.  The  method  permits  of  successive  approximations.  Thus,  if  a\ 

is  a  first  approximation  to  Vrc,  then  #2  =  (fli  +  n/ai)/2  is  a  better  approxima¬ 
tion,  and  a3  =  (a2  +  rc/fl2)/2  is  still  better,  and  so  on.  Approximate  succes¬ 
sively,  by  Heron’s  method,  \/3  and  V720. 

1.5.5  In  some  problems  in  the  Heronian  collection  appear  the  formulas 

(r  +  s)  =b  { (r  +  s)2  —  8r^ } 1/2 
a,b  =  - - - , 


for  the  legs  a  and  b  of  a  right  triangle  of  perimeter  2s  and  inradius  r.  Obtain 
these  formulas. 
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1.5.6  (a)  In  his  work  Catoptrica,  Heron  proves,  on  the  assumption  that  light 
travels  by  the  shortest  path,  that  the  angles  of  incidence  and  reflection  in  a 
mirror  are  equal.  Prove  this. 

(b)  A  man  wishes  to  go  from  his  house  to  the  bank  of  a  straight  river 
for  a  pail  of  water,  which  he  will  then  carry  to  his  barn,  on  the  same  side 
of  the  river  as  his  house.  Find  the  point  on  the  riverbank  which  will  minimize 
the  distance  the  man  must  travel. 

1.5.7  A  regular  heptagon  (seven-sided  polygon)  cannot  be  constructed  with 
straightedge  and  compasses.  In  his  work  Metrica ,  Heron  takes,  for  an  approxi¬ 
mate  construction,  the  side  of  the  heptagon  equal  to  the  apothem  of  a  regular 
hexagon  having  the  same  circumcircle.  How  good  an  approximation  is  this? 

1.5.8  Assuming  the  equality  of  alternate  interior  angles  formed  by  a  trans¬ 
versal  cutting  a  pair  of  parallel  lines,  prove  the  following: 

(a)  The  sum  of  the  angles  of  a  triangle  is  equal  to  a  straight  angle. 

(b)  The  sum  of  the  interior  angles  of  a  convex  polygon  of  n  sides  is 

equal  to  n  —  2  straight  angles. 

1.5.9  Assuming  (1)  A  central  angle  of  a  circle  is  measured  by  its  intercepted 

arc,  (2)  The  sum  of  the  angles  of  a  triangle  is  equal  to  a  straight  angle, 

(3)  The  base  angles  of  an  isosceles  triangle  are  equal,  (4)  A  tangent  to  a 

circle  is  perpendicular  to  the  radius  drawn  to  the  point  of  contact,  establish 
the  following  chain  of  theorems: 

(a)  An  exterior  angle  of  a  triangle  is  equal  to  the  sum  of  the  two 
remote  interior  ^angles. 

(b)  An  inscribed  angle  in  a  circle  is  measured  by  one  half  its  intercepted 

arc. 

(c)  An  angle  formed  by  two  intersecting  chords  in  a  circle  is  measured 
by  one  half  the  sum  of  the  two  intercepted  arcs. 

(d)  An  angle  formed  by  two  intersecting  secants  of  a  circle  is  measured 
by  one  half  the  difference  of  the  two  intercepted  arcs. 

(e)  An  angle  formed  by  a  tangent  to  a  circle  and  a  chord  through  the 
point  of  contact  is  measured  by  one  half  the  intercepted  arc. 

(f)  An  angle  formed  by  a  tangent  and  an  intersecting  secant  of  a  circle 
is  measured  by  one  half  the  difference  of  the  two  intercepted  arcs. 

(g)  An  angle  formed  by  two  intersecting  tangents  of  a  circle  is  measured 
by  one  half  the  difference  of  the  two  intercepted  arcs. 

1.5.10  Assuming  the  area  of  a  rectangle  is  given  by  the  product  of  its  two 
dimensions,  establish  the  following  chain  of  theorems: 

(a)  The  area  of  a  parallelogram  is  equal  to  the  product  of  its  base 
and  altitude. 

(b)  The  area  of  a  triangle  is  equal  to  half  the  product  of  any  side  and 
the  altitude  on  that  side. 

(c)  The  area  of  a  right  triangle  is  equal  to  half  the  product  of  its  two 

legs. 

(d)  The  area  of  a  triangle  is  equal  to  half  the  product  of  its  perimeter 
and  the  radius  of  its  inscribed  circle. 
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(e)  The  area  of  a  trapezoid  is  equal  to  the  product  of  its  altitude  and 
half  the  sum  of  its  bases. 

(f)  The  area  of  a  regular  polygon  is  equal  to  half  the  product  of  its 
perimeter  and  the  radius  of  its  inscribed  circle. 

(g)  The  area  of  a  circle  is  equal  to  half  the  product  of  its  circumference 
and  its  radius. 


_  CHAPTER  TWO 

EUCLID’S  “ELEMENTS” 


2.1  The  Importance  and  Formal 

Nature  of  Euclid’s  Elements 

The  earliest  extensively  developed  example  of  the  use  of  the  axiomatic 
method  that  has  come  down  to  us  is  the  very  remarkable  and  historically 
important  Elements  of  Euclid.  The  production  of  this  treatise  is  generally 
regarded  as  the  first  great  landmark  in  the  history  of  mathematical  thought 
and  organization,  and  its  subsequent  influence  on  scientific  thinking  can 
hardly  be  overstated. 

Of  Euclid  himself,  however,  disappointingly  little  is  known.  It  is  from 
Proclus’  Commentary  on  Euclid ,  Book  /,  that  we  obtain  our  most  satisfying 
information  about  Euclid.  He  writes, 

Euclid,  who  put  together  the  Elements ,  collected  many  of  the  theorems  of 
Eudoxus.  He  perfected  many  of  the  theorems  of  Theaetetus,  and  also  brought 
to  irrefragable  demonstration  the  things  which  were  only  somewhat  loosely 
proved  by  his  predecessors.  This  man  lived  in  the  time  of  the  first  Ptolemy,  for 
Archimedes,  who  came  immediately  after  the  first  Ptolemy,  makes  mention  of 
Euclid,  and  furthermore,  it  is  said  that  Ptolemy  once  asked  him  if  there  was  in 
geometry  any  shorter  way  than  that  of  the  Elements ,  and  Euclid  answered  that 
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there  was  no  royal  road  to  geometry.  It  is  evident,  then,  that  Euclid  came  after 
the  time  of  Plato,  but  preceded  Eratosthenes  and  Archimedes.  * 

This  statement  would  imply  that  Euclid  lived  about  300  b.c.  Also,  from 
other  evidence,  it  seems  quite  certain  that  Euclid  was  the  first  professor  of 
mathematics  at  the  famous  University  of  Alexandria,  f  and  that  he  was  the 
founder  of  the  distinguished  and  long-lived  Alexandrian  School  of  Mathe¬ 
matics.  Even  his  birthplace  is  not  known,  but  there  is  some  reason  to  believe 
that  he  received  his  mathematical  training  in  the  Platoqic  School  at  Athens. 

Although  Euclid  wrote  at  least  ten  treatises  on  mathematics,  posterity 
has  come  to  know  him  chiefly  through  his  Elements ,  a  monumental  work 
written  in  thirteen  books,  or  parts.  This  extraordinary  work  so  quickly  and 
so  completely  superseded  all  previous  works  of  the  same  nature  that  now  no 
copies  remain  of  the  earlier  efforts.  Apparently  from  its  very  first  appearance 
it  was  accorded  the  highest  respect,  and  the  mere  citation  of  Euclid’s  book 
and  proposition  numbers  has  been  regarded  ever  since  as  sufficient  to  identify 
a  particular  theorem  or  construction.  With  the  single  exception  of  the  Bible, 
no  work  has  been  more  widely  studied  or  edited.  For  more  than  two  millennia 
it  has  dominated  all  teaching  of  geometry,  and  over  a  thousand  editions  of 
it  have  appeared  since  the  first  one  printed  in  1482.  And,  as  the  prototype 
of  the  axiomatic  or  postulational  method,  its  impact  on  the  development  of 
mathematics  has  been  enormous. 

Proclus  has  clarified  for  us  the  meaning  of  the  term  “elements.”  It 
seems  that  the  elements  of  any  demonstrative  study  are  to  be  regarded  as 
the  leading,  or  key,  theorems  which  are  of  wide  and  general  use  in  the 
subject.  Their  function  has  been  compared  to  that  of  the  letters  of  the 
alphabet  in  relation  to  language;  as  a  matter  of  fact,  letters  are  called  by  the 
same  name  in  Greek.  The  selection  of  the  theorems  to  be  taken  as  the 
elements  of  the  subject  requires  the  exercise  of  considerable  judgment. 
As  Proclus  says, 

Now  it  is  difficult,  in  each  science,  both  to  select  and  arrange  in  due  order 
the  elements  from  which  all  the  rest  is  resolved.  And  of  those  who  have  made  the 
attempt  some  were  able  to  put  together  more  and  some  less;  some  used  shorter 
proofs;  some  extended  their  investigations  to  an  indefinite  length;  some  avoided 
the  method  of  reductio  ad  absurdum;  some  avoided  proportion;  some  contrived 
preliminary  steps  directed  against  those  who  reject  the  principles;  and,  in  a  word, 
many  different  methods  have  been  invented  by  various  writers  of  elements. 

It  is  essential  that  such  a  treatise  should  be  rid  of  everything  superfluous 
(for  this  is  an  obstacle  co  the  acquisition  of  knowledge);  it  should  select  everything 

*The  quotations  from  Proclus  and  Aristotle  which  appear  in  this  and  the  next  chapter 
are  adapted,  by  permission,  from  T.  L.  Heath,  The  Thirteen  Books  of  Euclid' s  Elements 
(New  York:  Cambridge  University  Press,  1926),  pp.  1,  115,  116,  117—118,  119, 
121-122,  153-155,  202-203,  241-242. 

fFor  an  interesting  exposition  upon  Alexandria,  see  R.  E.  Langer  (Bibliography). 
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that  embraces  the  subject  and  brings  it  to  a  point  (for  this  is  of  supreme  service 
to  science) ;  it  must  have  great  regard  at  once  to  clearness  and  conciseness  (for 
their  opposites  trouble  our  understanding);  it  must  aim  at  the  embracing  of 
theorems  in  general  terms  (for  the  piecemeal  division  of  instruction  into  the  more 
partial  makes  knowledge  difficult  to  grasp).  In  all  these  ways  Euclid’s  system  of 
elements  will  be  found  to  be  superior  to  the  rest.. 

And,  elsewhere,  Proclus  says, 

Starting  from  these  elements,  we  shall  be  able  to  acquire  knowledge  of  the 
other  parts  of  this  science  as  well,  while  without  them  it  is  impossible  for  us  to  get 
a  grasp  of  so  complex  a  subject,  and  knowledge  of  the  rest  is  unattainable.  As  it 
is,  the  theorems  which  are  most  of  the  nature  of  principles,  most  simple,  and  most 
akin  to  the  first  hypotheses  are  here  collected,  in  their  appropriate  order;  and  the 
proofs  of  all  other  propositions  use  these  theorems  as  thoroughly  known,  and  start 
from  them.  Thus  Archimedes  in  the  books  on  the  sphere  and  cylinder,  Apollonius, 
and  all  other  geometers,  clearly  use  the  theorems  proved  in  this  very  treatise  as 
constituting  admitted  principles. 

Aristotle,  in  his  Metaphysics ,  speaks  of  “elements”  in  the  same  sense  when 
he  says,  “Among  geometrical  propositions  we  call  those  ‘elements’  the  proofs 
of  which  are  contained  in  the  proofs  of  all  or  most  of  such  propositions.” 

It  is  no  reflection  upon  the  brilliance  of  Euclid’s  work  that  there  had 
been  other  Elements  anterior  to  his  own.  According  to  the  Eudemian  Summary , 
Hippocrates  of  Chios  made  the  first  effort  along  this  line,  and  the  next 
attempt  was  that  of  Leon,  who  in  age  fell  somewhere  between  Plato  and 
Eudoxus.  It  is  said  that  Leon’s  work  contained  a  more  careful  selection  of 
propositions  than  did  that  of  Hippocrates,  and  that  these  propositions  were 
more  numerous  and  more  serviceable.  The  textbook  of  Plato’s  Academy  was 
written  by  Theudius  of  Magnesia  and  was  praised  as  an  admirable  collection 
of  elements.  The  geometry  of  Theudius  seems  to  have  been  the  immediate 
precursor  of  Euclid  s  work  and  was  undoubtedly  available  to  Euclid,  espe¬ 
cially  if  he  studied  in  the  Platonic  School.  Euclid  was  acquainted  also  with 
the  important  work  of  Theaetetus  and  Eudoxus.  Thus  it  is  probable  that 
Euclid’s  Elements  is,  for  the  most  part,  a  highly  successful  compilation  and 
systematic  arrangement  of  works  of  earlier  writers.  No  doubt  Euclid  had  to 
supply  a  number  of  the  proofs  and  to  perfect  many  others,  but  the  chief  merit 
of  his  work  lies  in  the  skillful  selection  of  the  propositions  and  in  their 
arrangement  into  a  logical  sequence  presumably  following  from  a  small 
handful  of  initial  assumptions. 

In  the  thirteen  books  that  comprise  Euclid’s  Elements  there  is  a  total  of 
465  propositions.  Contrary  to  popular  impression,  many  of  these  propositions 
are  concerned,  not  with  geometry,  but  with  number  theory  and  with 
elementary  (geometric)  algebra.  Book  I  contains  the  necessary  preliminary 
material,  together  with  theorems  on  congruence,  parallel  lines,  and  recti¬ 
linear  figures.  Book  II  is  devoted  to  geometric  algebra,  Book  III  to  circles, 
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and  Book  IV  to  the  construction  of  regular  polygons.  Books  V  and  VI 
contain  the  Eudoxian  theory  of  proportion  and  its  application  to  geometry. 
Books  VII,  VIII,  and  IX,  containing  a  total  of  102  propositions,  deal  with 
elementary  number  theory.  Book  X  is  devoted  to  the  study  of  irrationals, 
much  of  the  material  probably  from  Theaetetus.  The  remaining  three  books 
are  concerned  with  solid  geometry.  The  material  of  Books  I,  II,  and  IV 
was,  in  all  likelihood,  developed  by  the  early  Pythagoreans.  The  material 
found  in  current  American  high  school  plane  and  solid  geometry  texts  is 
largely  that  found  in  Euclid’s  Books  I,  III,  IV,  VI,  XI,  and  XII. 

Certainly  there  is  a  good  deal  in  the  contents  of  Euclid’s  Elements  which 
is  of  considerable  interest,  but  in  the  present  study  our  concern  is  with  the 
formal  nature  of  the  Elements  rather  than  with  its  mathematical  contents. 
In  fact,  the  various  consequences  of  the  formal  character  of  this  great  work 
will  constitute  some  of  our  chief  avenues  of  investigation.  At  the  moment, 
we  are  especially  interested  in  Euclid’s  conception  of  the  axiomatic  method 
and  in  the  precise  manner  in  which  he  applied  the  method  to  the  develop¬ 
ment  of  his  Elements.  We  consider  these  matters  in  the  two  following  sections. 


2.2  Aristotle  and  Proclus  on  the 

Axiomatic  Method 

It  is  a  misfortune  that  no  copy  of  Euclid’s  Elements  has  been  found  which 
actually  dates  from  the  author’s  own  time.  Modern  editions  of  the  work  are 
based  upon  a  revision  that  was  prepared  by  the  Greek  commentator  Theon 
of  Alexandria,  who  lived  almost  700  years  after  the  time  of  Euclid.  Theon’s 
revision  was,  until  the  early  nineteenth  century,  the  oldest  edition  of  the 
Elements  known  to  us.  In  1808,  however,  when  Napoleon  ordered  valuable 
manuscripts  to  be  taken  from  Italian  libraries  and  to  be  sent  to  Paris, 
F.  Peyrard  found,  in  the  Vatican  library,  a  tenth-century  copy  of  an  edition 
of  Euclid’s  Elements  which  predates  Theon’s  recension.  A  study  of  this  older 
edition  and  a  careful  sifting  of  citations  and  remarks  made  by  early  com¬ 
mentators  indicate  that  the  introductory  material  of  Euclid’s  original  treatise 
undoubtedly  underwent  some  editing  in  the  subsequent  revisions,  but  that 
the  propositions  and  their  proofs,  except  for  minor  additions  and  deletions, 
have  remained  essentially  as  Euclid  wrote  them. 

Because  of  our  lack  of  a  copy  of  Euclid’s  original  treatise,  and  because 
of  the  changes  and  additions  made  by  later  editors,  it  is  not  certain  precisely 
what  statements  Euclid  assumed  at  the  start  of  his  work,  nor  even  how  many 
such  statements  he  had.  Also,  unfortunately,  there  is  no  known  commentary 
by  Euclid  himself  upon  the  nature  of  the  deductive  organization  used  so 
successfully  in  his  mathematical  studies.  It  would  be  valuable  to  have  Euclid  s 
own  point  of  view  upon  the  meaning  of  proof,  or  upon  the  significance 
which  he  attached  to  such  terms  as  “definition,”  “axiom,”  and  “postulate.” 
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Even  partially  to  understand  Euclid,  therefore,  we  must  study  the  ideas  held 
by  Euclid  s  contemporaries.  Aristotle,  in  particular,  is  an  important  source 
of  information.  Since  Aristotle  studied  at  Plato’s  Academy,  his  scholastic 
background  may  have  been  quite  similar  to  that  of  Euclid. 

A  student  of  mathematics  would  do  well  to  study  Aristotle’s  Analytica 
postenora.  The  following  passage  from  that  work  is  particularly  full  and 
enlightening: 

By  the  first  principles  of  a  subject  I  mean  those  the  truth  of  which  it  is  not 
possible  to  prove.  What  is  denoted  by  the  first  terms  and  those  derived  from  them 
is  assumed,  but,  as  regards  their  existence ,  this  must  be  assumed  for  the  principles 
but  proved  for  the  rest.  Thus  what  a  unit  is,  what  a  straight  line  is,  or  what  a 
0 iangle  is,  must  be  assumed;  and  the  existence  of  the  unit  and  of  magnitude 
must  also  be  assumed,  but  the  existence  of  the  rest  must  be  proved.  Now  of  the 
premises  used  in  demonstrative  sciences  some  are  peculiar  to  each  science  and 
others  common  to  all,  the  latter  being  common  by  analogy,  for  of  course  they  are 
actually  useful  insofar  as  they  are  applied  to  the  subject-matter  included  under 
the  particular  science.  Instances  of  first  principles  peculiar  to  a  science  are  the 
assumptions  that  a  line  is  of  such  and  such  a  character,  and  similarly  for  a  straight 
fine;  whereas  it  is  a  common  principle,  for  instance,  that  if  equals  be  subtracted 
from  equals,  the  remainders  are  equal.  But  it  is  enough  that  each  of  the  common 
pi  inciples  is  true  as  regards  tne  particular  subject-matter;  in  geometry,  for 
instance,  the  effect  will  be  the  same  even  if  the  common  principles  be  assumed 

to  be  true,  not  of  everything,  but  only  of  magnitudes,  and,  in  arithmetic,  of 
numbers. 

Now  the  things  peculiar  to  the  science,  the  existence  of  which  must  be 
assumed,  are  the  things  with  reference  to  which  the  science  investigates  the 
essential  attributes,  for  example  arithmetic  with  reference  to  units,  and  geometry 
with  reference  to  points  and  lines.  With  these  things  it  is  assumed  that  they  exist 
and  that  they  are  of  such  and  such  a  nature.  But,  with  regard  to  their  essential 
properties,  what  is  assumed  is  only  the  meaning  of  each  term  employed;  thus 
arithmetic  assumes  the  answer  to  the  question  what  is  meant  by  “odd”  or  “even,” 
a  square  or  a  cube,  and  geometry  to  the  question  what  is  meant  by  “the 
irrational,  or  deflection,  or  the  so-called  “verging”  to  a  point;  but  that  there 
aie  such  things  is  proved  by  means  of  the  common  principles  and  of  what  has 
alieady  been  demonstrated.  It  is  similar  with  astronomy.  For  every  demon¬ 
strative  science  has  to  do  with  three  things,  (1)  the  things  which  are  assumed  to 
exist,  namely  the  subject-matter  in  each  case,  the  essential  properties  of  which 
the  science  investigates,  (2)  the  so-called  common  axioms,  which  are  the  primary 
source  of  demonstration,  and  (3)  the  properties,  with  regard  to  which  all  that 
is  assumed  is  the  meaning  of  the  respective  terms  used. 

This  remarkable  passage  is  almost  modern  in  its  point  of  view.  It  says 
that  a  demonstrative  science  must  start  from  a  set  of  assumptions,  known  as 
the  first  principles  of  the  subject.  These  first  principles  constitute  a  sort  of 
platform  of  initial  agreement  from  which  the  rest  of  the  discourse  can  be 
launched  by  purely  deductive  procedures.  Of  these  principles,  according  to 
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Aristotle,  some  are  common  to  all  sciences  and  others  are  peculiar  to  the 
particular  science  being  studied.  The  first  principles  common  to  all  sciences 
are  called  axioms  (illustrated  by,  “if  equals  be  subtracted  from  equals,  the 
remainders  are  equal”).  Among  the  first  principles,  or  initial  assumptions, 
peculiar  to  the  science  being  studied,  we  have,  first  of  all,  statements  of  the 
existence  of  the  subject  matter  and  of  the  fundamental  things  whose  properties 
the  science  intends  to  investigate  (for  example,  in  geometry,  we  must  assume 
the  existence  of  “magnitude,”  of  “points,”  and  of  “lines”).  Also  among  the 
first  principles  peculiar  to  the  science  being  studied  we  have  the  connotation 
of  the  technical  terms  employed  in  the  discourse.  That  is,  we  must  accept 
certain  definitions  concerning  manifestations  or  attributes  of  our  subject 
matter  (for  example,  in  geometry,  we  must  assume  what  is  meant  by  “triangle” 
and  by  “irrational”).  These  definitions,  however,  say  nothing  of  the  existence 
of  the  things  defined,  but  must  be  merely  understood.  The  existence  of  only 
the  subject  matter  and  the  fundamental  things  is  assumed;  the  existence  of 
all  other  things  defined  must  be  proved. 

In  addition  to  the  definitions,  one  might  expect  to  find  among  the  first 
principles  which  are  peculiar  to  the  particular  science  being  studied  some 
statements  concerning  properties  or  relationships  of  the  technical  terms  of 
the  discourse.  Certainly,  since  we  cannot  prove  all  the  statements  of  our 
discourse,  we  anticipate  the  need  for  some  such  assumed  statements  for  the 
purpose  of  getting  started.  About  such  assumptions  Aristotle,  again  in  his 
Analytica  posterior  a,  has  the  following  to  say: 

Now  anything  that  the  teacher  assumes,  though  it  is  matter  of  proof,  without 
proving  it  himself,  is  a  hypothesis  if  the  thing  assumed  is  believed  by  the  learner, 
and  it  is  moreover  a  hypothesis,  not  absolutely,  but  relatively  to  the  particular 
pupil;  but  if  the  same  thing  is  assumed  when  the  learner  either  has  no  opinion 
on  the  subject  or  is  of  contrary  opinion,  it  is  a  postulate.  This  is  the  diffeience 
between  a  hypothesis  and  a  postulate;  for  a  postulate  is  that  which  is  lather 
contrary  than  otherwise  to  the  opinion  of  the  learner,  or  whatever  is  assumed 
and  used  without  being  proved,  although  matter  for  demonstration.  Now 
definitions  are  not  hypotheses,  for  they  do  not  assert  the  existence  or  non-existence 
of  anything,  while  hypotheses  are  among  propositions.  Definitions  only  require 
to  be  understood;  a  definition  is  therefore  not  a  hypothesis,  unless  indeed  it  be 
asserted  that  any  audible  speech  is  a  hypothesis.  A  hypothesis  is  that  from  the 
truth  of  which,  if  assumed,  a  conclusion  can  be  established. 

It  must  be  admitted  that  Aristotle’s  notion  of  a  postulate  and  of  the 
role  that  a  postulate  plays  in  a  demonstrative  science  is  not  too  clear.  His 
remarks  imply  that  a  postulate  represents  the  assumption  of  a  thing  which  is 
properly  a  subject  of  demonstration,  and  that  the  assumption  is  made  with¬ 
out,  perhaps,  the  assent  of  the  student.  In  other  words,  a  postulate  may  not 
appeal  to  a  person’s  sense  of  what  is  right,  but  it  has  been  adopted  as  basic 
in  order  that  the  work  may  proceed.  From  this  point  of  view,  then,  a  postulate 
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is  3.  first  principle.  In  contradistinction  to  this,  a  hypothesis  is  an  assumption 
believed  in  by  the  learner,  and  thus  is  introduced  apparently  in  order  to 
continue  an  argument.  For  example,  once  a  theorem  has  been  established, 
and  hence  is  acceptable  to  the  learner,  that  theorem  may  be  taken  as  a 
hypothesis  from  which  to  deduce  some  later  theorem.  If  we  read  further  in 
the  works  of  Aristotle  we  find  other  passages  which  are  of  special  significance 
in  comprehending  the  organization  of  Euclid’s  Elements.  In  several  places 
we  find  that  Aristotle  regards  an  axiom  as  a  universal  assumption  that  is 
so  self-evident  that  no  sane  person  would  question  it;  also  he  considers  an 
axiom  to  be  too  fundamental  ever  to  be  regarded  as  matter  for  demonstration. 
We  thus  seem  to  have,  according  to  Aristotle,  the  following  four  distinctions 
between  an  axiom  and  a  postulate.  An  axiom  is  common  to  all  sciences, 
whereas  a  postulate  is  related  to  a  particular  science;  an  axiom  is  self-evident, 
whereas  a  postulate  is  not;  an  axiom  cannot  be  regarded  as  a  subject  for 
demonstration,  whereas  a  postulate  is  properly  such  a  subject;  an  axiom  is 
assumed  with  the  ready  assent  of  the  learner,  whereas  a  postulate  is  assumed 
without,  perhaps,  the  assent  of  the  learner.  Some  of  Aristotle’s  statements 
appear  somewhat  contradictory,  but  the  interpretations  just  given  seem 
especially  appropriate  in  any  attempt  to  understand  Euclid’s  work. 

Aristotle’s  characterizations  of  definitions,  axioms,  and  postulates  are 
further  clarified  by  the  following  account  given  by  Proclus  in  his  Commentary 
on  Euclid ,  Book  I.  * 

The  compiler  of  elements  in  geometry  must  give  separately  the  principles 
of  the  science,  and,  after  that,  the  conclusions  from  those  principles,  not  giving 
any  account  of  the  principles  but  only  of  their  consequences.  No  science  proves 
its  own  principles,  or  even  discourses  about  them;  they  are  treated  as  self-evident. 

Thus  the  first  essential  was  to  distinguish  the  principles  from  their  conse¬ 
quences.  Euclid  carries  out  this  plan  practically  in  every  book  and,  as  a  prelimi¬ 
nary  to  the  whole  enquiry,  sets  out  the  common  principles  of  this  science.  Then 
he  divides  the  common  principles  themselves  into  definitions,  postulates,  and  axioms. 

For  all  these  are  different  from  one  another;  an  axiom,  a  postulate,  and  a 
definition  are  not  the  same  thing,  as  the  inspired  Aristotle  has  somewhere  pointed 
out.  Whenever  that  which  is  assumed  and  ranked  as  a  principle  is  both  known 
to  the  learner  and  convincing  in  itself,  such  a  thing  is  an  axiom,  for  example  the 
statement  that  things  which  are  equal  to  the  same  thing  are  also  equal  to  one 
another.  When,  on  the  other  hand,  the  pupil  has  not  the  notion  of  what  is  told 
him  which  carries  conviction  in  itself,  but  nevertheless  lays  it  down  and  assents 
to  its  being  assumed,  such  an  assumption  is  a  definition.  Thus  we  do  not  precon¬ 
ceive  by  virtue  of  a  common  notion,  and  without  being  taught,  that  the  circle 
is  such  and  such  a  figure,  but,  when  we  are  told  so,  we  assent  without  demon¬ 
stration.  When,  again,  what  is  asserted  is  both  unknown  and  assumed  even 
without  the  assent  of  the  learner,  then,  he  says,  we  call  this  a  postulate,  for  example 

We  have  everywhere  corrected  a  confusion  that  exists  in  the  original  statement 
cause d  by  Pi  oclus  consistent  misuse  of  the  term  "hypothesis”  for  the  term  “definition.” 
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that  all  right  angles  are  equal.  This  view  of  a  postulate  is  clearly  implied  by  those 
who  have  made  a  special  and  systematic  attempt  to  show,  with  regard  to  one  of 
the  postulates,  that  it  cannot  be  assented  to  by  any  one  straight  off.  According 
then  to  the  teaching  of  Aristotle,  an  axiom,  a  postulate,  and  a  definition  are  thus 
distinguished. 

That  there  was  no  unanimity  of  opinion,  even  among  the  early  Greek 
mathematicians  themselves,  concerning  the  precise  nature  of,  and  the 
difference  between,  an  axiom  and  a  postulate  is  borne  out  by  remarks  made 
by  Proclus.  Proclus  points  out  the  following  three  distinctions  advocated 
by  various  parties:  (1)  An  axiom  is  a  self-evident  assumed  statement  about 
something,  and  a  postulate  is  a  self-evident  assumed  construction  of  some¬ 
thing;  thus  axioms  and  postulates  bear  a  relation  to  one  another  much  like 
that  which  exists  between  theorems  and  construction  problems.  (2)  An  axiom 
is  an  assumption  common  to  all  sciences,  whereas  a  postulate  is  an  assumption 
peculiar  to  the  particular  science  being  studied.  (3)  An  axiom  is  an  assump¬ 
tion  of  something  that  is  both  obvious  and  acceptable  to  the  learner;  a 
postulate  is  an  assumption  of  something  that  is  neither  necessarily  obvious 
nor  necessarily  acceptable  to  the  learner.  (This  last  is  essentially  the  Aris¬ 
totelian  distinction.)  Further  confusion  is  indicated  by  Proclus  when  he 
points  out  that  some  preferred  to  call  them  all  postulates. 

In  summary,  then,  according  to  the  Greek  conception  of  the  axiomatic 
method,  every  demonstrable  science  must  start  from  assumed  first  principles. 
These  first  principles  consist  of  definitions,  axioms  (or  common  notions),  and 
postulates.  The  definitions  describe  the  technical  terms  used  in  the  discourse, 
and,  except  in  the  case  of  a  few  fundamental  terms,  are  not  meant  to  imply 
the  existence  of  the  entities  described.  The  axioms  and  the  postulates  are 
initial  statements  that  must  be  assumed  in  order  that  the  discourse  may 
proceed.  Just  which  of  these  statements  should  be  called  axioms  and  which 
postulates  was  a  matter  of  varying  opinion. 

2.3  Euclid’s  Definitions ,  Axioms , 

and  Postulates 

Adhering  to  the  Greek  conception  of  the  axiomatic  method,  we  find, 
at  the  very  start  of  Book  I  of  Euclid’s  Elements ,  a  list  of  the  definitions, 
postulates,  and  common  notions  which  are  to  serve  as  the  first  principles 
of  the  work.  Some  of  the  succeeding  books  of  the  work  commence  with 
additional  lists  of  definitions.  It  is  presumed  by  the  author  that  all  of  the 
465  propositions  included  in  the  treatise  are  logically  deduced  from  these 
principles.  For  reference,  we  now  give  here  the  complete  set  of  first  principles 
for  Book  I  essentially  as  furnished  by  T.  L.  Heath*  in  his  translation  of  the 
distinguished  Heiberg  text  of  Euclid’s  Elements. 


*T.  L.  Heath  [5],  1,  153-155. 
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Definitions 

1.  A  point  is  that  which  has  no  part. 

2.  A  line  is  length  without  breadth. 

3.  The  extremities  of  a  line  are  points. 

4.  A  straight  line  is  a  line  which  lies  evenly  with  the  points  on  itself. 

5.  A  surface  is  that  which  has  only  length  and  breadth. 

6.  The  extremities  of  a  surface  are  lines. 

7.  A  plane  surface  is  a  surface  which  lies  evenly  with  the  straight  lines  on  itself. 

8.  A  plane  angle  is  the  inclination  to  one  another  of  two  lines  in  a  plane  if  the 
lines  meet  and  do  not  lie  in  a  straight  line. 

9.  When  the  lines  containing  the  angle  are  straight  lines,  the  angle  is  called  a 
rectilinear  angle. 

10.  When  a  straight  line  erected  on  a  straight  line  makes  the  adjacent  angles 
equal  to  one  another,  each  of  the  equal  angles  is  called  a  right  angle ,  and  the 
straight  line  standing  on  the  other  is  called  a  perpendicular  to  that  on  which 
it  stands. 

11.  An  obtuse  angle  is  an  angle  greater  than  a  right  angle. 

12.  An  acute  angle  is  an  angle  less  than  a  right  angle. 

13.  A  boundary  is  that  which  is  an  extremity  of  anything. 

14.  A  figure  is  that  which  is  contained  by  any  boundary  or  boundaries. 

15.  A  circle  is  a  plane  figure  contained  by  one  line  such  that  all  the  straight 
lines  falling  upon  it  from  one  particular  point  among  those  lying  within  the 
figure  are  equal. 

16.  The  particular  point  (of  Definition  15)  is  called  the  center  of  the  circle. 

17.  A  diameter  of  a  circle  is  any  straight  line  drawn  through  the  center  and 
terminated  in  both  directions  by  the  circumference  of  the  circle.  Such  a 
straight  line  also  bisects  the  circle. 

18.  A  semicircle  is  the  figure  contained  by  a  diameter  and  the  circumference  cut 
off  by  it.  The  center  of  the  semicircle  is  the  same  as  that  of  the  circle. 

19.  Rectilinear  figures  are  those  which  are  contained  by  straight  lines,  trilateral 
figures  being  those  contained  by  three,  quadrilateral  those  contained  by  four, 
and  multilateral  those  contained  by  more  than  four  straight  lines. 

20.  Of  the  trilateral  figures,  an  equilateral  triangle  is  one  which  has  its  three  sides 
equal,  an  isosceles  triangle  has  two  of  its  sides  equal,  and  a  scalene  triangle  has 
its  three  sides  unequal. 

21.  Furthermore,  of  the  trilateral  figures,  a  right-angled  triangle  is  one  which  has 
a  right  angle,  an  obtuse-angled  triangle  has  an  obtuse  angle,  and  an  acute-angled 
triangle  has  its  three  angles  acute. 

22.  Of  the  quadrilateral  figures,  a  square  is  one  which  is  both  equilateral  and 
right-angled;  an  oblong  is  right-angled  but  not  equilateral;  a  rhombus  is  equi¬ 
lateral  but  not  right-angled;  and  a  rhomboid  has  its  opposite  sides  and  angles 
equal  to  one  another  but  is  neither  equilateral  nor  right-angled.  Quadri¬ 
laterals  other  than  these  are  called  trapezia. 

23.  Parallel  straight  lines  are  straight  lines  which,  being  in  the  same  plane  and 
being  produced  indefinitely  in  both  directions,  do  not  meet  one  another  in 
either  direction. 
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Postulates 

Let  the  following  be  postulated: 

1.  A  straight  line  can  be  drawn  from  any  point  to  any  point. 

2.  A  finite  straight  line  can  be  produced  continuously  in  a  straight  line. 

3.  A  circle  may  be  described  with  any  center  and  distance. 

4.  All  right  angles  are  equal  to  one  another. 

5.  If  a  straight  line  falling  on  two  straight  lines  makes  the  interior  angles  on 
the  same  side  together  less  than  two  right  angles,  the  two  straight  lines,  if 
produced  indefinitely,  meet  on  that  side  on  which  the  angles  are  together 
less  than  two  right  angles. 

Common  notions 

1  Things  which  are  equal  to  the  same  thing  are  also  equal  to  one  another. 

2.  If  equals  be  added  to  equals,  the  wholes  are  equal. 

3.  If  equals  be  subtracted  from  equals,  the  remainders  are  equal. 

4.  Things  which  coincide  with  one  another  are  equal  to  one  another. 

5.  The  whole  is  greater  than  the  part. 

We  observe  that  the  first  principles  of  Euclid’s  Elements  fit  quite  well 
the  Aristotelian  account  of  definitions,  postulates,  and  axioms  as  given  iii 
Section  2.2.  It  would  also  seem  that  Euclid  strove  to  keep  his  list  of  postulates 
and  axioms  to  an  irreducible  minimum.  This  economy,  too,  is  in  keeping 
with  Aristotle’s  views,  for  in  his  Analytica  posterior  a  he  says,  “other  things 
being  equal,  that  proof  is  the  better  which  proceeds  from  the  fewer  postulates, 
or  hypotheses,  or  propositions.” 

We  shall  pass  over  Euclid’s  definitions  without  much  comment.  Most 
of  them  probably  were  taken  from  earlier  works,  which  would  account  for 
the  fact  that  some  terms,  like  oblong ,  rhombus ,  and  rhomboid ,  are  included  but 
are  never  used  anywhere  in  the  work.  It  is  curious  that  after  having  defined 
parallel  lines  Euclid  does  not  give  a  formal  definition  of  parallelogram.  The 
existence  of  a  parallelogram  is  established  in  I  33,*  and  in  I  34  it  is  referred 
to  as  a  parallelogramic  area\  then  in  I  35  this  latter  expression  is  shortened  to 
parallelogram .  We  note  that  to  the  definition  of  a  diameter  of  a  circle  (Defini¬ 
tion  17)  is  appended  the  statement,  “Such  a  straight  line  also  bisects  the 
circle.”  This  addition  is,  of  course,  really  a  theorem  (one  of  those  attributed 
by  Proclus  in  the  Eudemian  Summary  to  Thales),  but  its  statement  in  Definition 
17  is  necessary  in  order  to  justify  the  definition  of  a  semicircle  which  imme¬ 
diately  follows.  There  are  indications  for  believing  that  the  definitions  of  a 
straight  line  and  of  a  plane  (Definitions  4  and  7)  were  original  with  Euclid. 
These  definitions  are  not  easy  to  understand,  but  can  be  comprehended,  at 
least  partially,  if  we  appeal  to  sight  by  considering  an  eye  placed  at  an 
extremity  of  the  line  or  the  plane  and  looking,  respectively,  along  the  line 
or  the  plane.  Other  interpretations  of  these  definitions  have  been  given. 

*1  33  means  Proposition  33  of  Book  I. 
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A  number  of  Euclid’s  definitions  are  vague  and  virtually  meaningless;  we 
shall  return  to  this  in  the  next  section.  The  work  of  Heath  previously  referred 
to  contains  a  full  and  valuable  commentary  on  Euclid’s  definitions. 

Some  aspects  of  Euclid’s  postulates  are  of  especial  interest.  The  first 
three  are  postulates  of  construction,  for  they  assert  what  we  are  permitted 
to  draw.  Since  these  postulates  restrict  constructions  to  only  those  that  can 
be  made  in  a  permissible  manner  with  straightedge  and  compasses,  these 
instruments,  so  limited,  have  become  known  as  Euclidean  tools ,  although 
their  use  under  these  restrictions  certainly  predates  Euclid.  The  construction 
of  figures  with  only  straightedge  and  compasses,  viewed  as  a  game  played 
according  to  the  rules  set  down  in  Euclid’s  first  three  postulates,  has  proved 
to  be  one  of  the  most  fascinating  and  absorbing  games  ever  devised.  One 
is  surprised  at  the  really  intricate  constructions  that  can  be  accomplished 
in  the  allowed  manner,  and  accordingly  it  is  hard  to  believe  that  certain 
seemingly  simple  construction  problems,  like  that  of  trisecting  a  given 
arbitrary  angle,  for  example,  cannot  also  be  so  accomplished.  The  energetic 
efforts  of  early  Greek  geometers  to  solve  legitimately  some  of  the  construction 
problems  which  are  now  known  to  be  beyond  the  use  of  Euclidean  tools 
profoundly  influenced  the  development  of  much  of  the  content  of  early 
geometry.  For  example,  the  invention  of  the  conic  sections,  of  many  cubic 
and  quartic  curves,  and  of  several  transcendental  curves  resulted  from  this 
work.  A  later  outgrowth  was  the  development,  in  modern  times,  of  portions 
of  the  theory  of  equations,  of  the  theory  of  algebraic  numbers,  and  of  group 
theory.  This  whole  line  of  mathematical  development,  so  intimately  tied  to 
Euclid  s  first  three  postulates,  has  little  connection  with  our  present  line  of 
investigation,  and  so  will  not  be  further  considered  here.  We  shall  return 
to  the  subject,  however,  in  Section  A. 3  of  the  Appendix. 

Postulates  1  and  3  refer  to  existence.  In  other  words,  the  existence  of  a 
straight  line  joining  any  two  given  points  is  assumed,  as  is  the  existence  of  a 
circle  having  any  given  center  and  radius.  From  applications  that  Euclid 
makes  of  Postulates  1  and  2,  it  appears  that  these  postulates  are  meant  also 
to  imply  that  the  straight  line  segment  joining  two  points  in  the  one  case, 
and  the  produced  portion  in  the  other  case,  are  unique ,  although  it  must  be 
admitted  that  the  postulates  do  not  explicitly  say  as  much.  Postulate  3  may 
be  construed  as  implying  something  in  regard  to  the  continuity  and  extent 
of  the  space  under  consideration,  since  the  radius  of  the  circle  may  be  as 
small  or  as  large  as  one  desires. 

Postulates  4  and  5  are  quite  different  from  the  first  three  postulates. 
The  meaning  of  Postulate  4  is  certainly  evident,  but  there  has  been  much 
debate  on  whether  it  is  properly  classified  when  placed  among  the  postulates. 
If  it  should  be  classified  as  a  theorem  its  proof  would  have  to  be  accomplished 
by  applying  one  pair  of  adjacent  right  angles  to  another  such  pair,  but 
Euclid  preferred  to  shun,  as  much  as  possible,  such  proofs  by  superposition. 
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In  any  event,  Euclid  had  to  place  Postulate  4  before  his  Postulate  5,  since  the 
condition  in  Postulate  5  that  a  certain  pair  of  interior  angles  be  together 
less  than  two  right  angles  would  be  useless  unless  it  were  first  made  clear 
that  all  right  angles  are  equal. 

Postulate  5,  known  as  Euclid’s  parallel  postulate ,  has  become,  as  we  shall 
see,  one  of  the  most  famous  statements  in  mathematical  history.  There  is 
more  evidence  for  the  origin  of  this  postulate  with  Euclid  than  for  the  origin 
of  any  of  the  other  four,  because  Aristotle  alludes  to  a  petitio  principii ,  or  a 
circularity  in  reasoning,  that  was  involved  in  the  theory  of  parallels  current 
in  his  time.  It  is  a  mark  of  Euclid’s  mathematical  acumen  that  he  perceived 
that  the  only  way  out  of  the  difficulty  was  to  lay  down  some  postulate  as  a 
basis  for  the  theory  of  parallels  that  is  so  essential  to  the  development  of  his 
geometry.  The  postulate  that  he  formulated  serves  this  purpose  admirably 
and  also,  at  the  same  time,  furnishes  a  criterion  for  determining  whether  two 
straight  lines  in  a  figure  will  or  will  not  meet  if  extended.  This  fact  is  an 
advantage  of  Euclid’s  postulate  over  the  substitutes  that  were  later  suggested 
to  take  its  place,  and  this  advantage  is  actually  employed  in  the  Elements 
as  early  as  I  44.  The  consequences  of  investigations  carried  on  in  connection 
with  Euclid’s  fifth  postulate  proved  to  be  very  far-reaching.  Not  only  did 
these  investigations  supply  the  stimulus  for  the  development  of  much  of  the 
mathematics  which  we  characterize  as  modern,  but  they  led  to  a  far  deeper 
examination,  and  consequent  perfection,  of  the  axiomatic  method.  These 
investigations  are  therefore  vital  to  our  present  study  and  will  constitute  the 
dramatic  story  of  the  next  chapter. 

Of  the  common  notions,  or  axioms,  there  is  reason  to  believe  that  the 
first  three  were  given  by  Euclid,  but  that  the  last  two  may  have  been  added 
atm  later  time.  Axiom  4  has  been  criticized  on  the  ground  that  its  subject 
matter  is  special  rather  than  general,  and  that  it  ought  therefore  to  be  listed 
as  a  postulate  instead  of  as  an  axiom.  Objections  which  can  be  raised  to 
the  method  of  superposition,  used  by  Euclid  with  apparent  reluctance  to 
establisfi  some  of  his  early  congruence  theorems,  can  be  at  least  partially 
met  by  Axiom  4.  Again  the  student  is  referred  to  the  excellent  commentary 
given  by  Heath. 

In  conclusion,  we  may  summarize  Euclid’s  conception  and  use  of  the 
axiomatic  method  somewhat  as  follows:  Every  deductive  system  requires 
assumptions  from  which  the  deduction  may  proceed.  Therefore,  as  initial 
premises,  Euclid  puts  down  five  postulates,  or  assumed  statements  about  his 
subject  matter.  In  addition  to  the  five  postulates,  Euclid  lists  five  axioms, 
or  common  notions,  which  he  also  needs  for  his  proofs.  These  axioms  are 
not  peculiar  to  his  subject  matter  but  are  general  principles  valid  in  any 
field  of  study.  Now  in  the  postulates  a  number  of  terms  occur,  such  as  point , 
straight  line ,  right  angle ,  and  circle ,  of  which  it  is  not  certain  that  the  reader 
has  a  precise  notion.  Hence  some  definitions  are  also  given.  These  definitions 
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are  not,  like  the  postulates,  assumptions  about  the  nature  of  the  subject 
matter  but  are  merely  explanations  of  the  meanings  of  the  terms.  Definition 
10,  for  example,  tells  what  a  right  angle  is,  and  how  an  angle  may  be 
identified  as  a  right  angle,  but  it  says  nothing  about  the  existence  of  right 
angles,  nor  does  it  state  what  is  assumed  about  such  angles.  These  latter 
functions  are  left  to  the  postulates  and  to  deduced  propositions.  Thus  Postu¬ 
late  4  informs  us  that  all  right  angles  are  equal,  and  Proposition  Ill  proves 
that  right  angles  exist.  On  the  other  hand,  Postulate  4  gives  no  clue  regarding 
the  nature  of  a  right  angle,  nor  does  it  tell  how  the  term  is  to  be  employed; 
it  merely  states  a  fundamental  assumption  about  such  angles.  Finally,  the 
natural  order  for  presenting  the  postulates,  axioms,  and  definitions  to  the 
student  is,  first,  the  definitions  explaining  the  meanings  of  the  technical  terms 
of  the  discourse,  next,  the  postulates  which  are  so  closely  related  to  the 
definitions,  and  lastly,  the  axioms  or  common  notions. 

2.4  Some  Logical  Shortcomings 

of  Euclid's  Elements 

It  would  be  very  surprising  indeed  if  Euclid’s  Elements ,  because  it  is 
such  an  early  and  extensive  application  of  the  axiomatic  method,  should  be 
free  of  logical  blemishes.  Therefore  it  is  no  great  discredit  to  the  work  that 
critical  investigations  have  revealed  a  number  of  defects  in  its  logical  struc¬ 
ture.  Probably  the  gravest  of  these  defects  are  certain  tacit  assumptions  that 
are  employed  later  in  the  deductions  and  are  not  granted  by  the  first  princi¬ 
ples  of  the  work.  This  danger  exists  in  any  deductive  study  when  the  subject 
matter  is  overly  familiar  to  the  author.  Usually  a  thorough  grasp  of  the 
subject  matter  in  a  field  of  human  endeavor  is  regarded  as  an  indispensable 
prerequisite  to  serious  work,  but  in  developing  a  deductive  system  such 
knowledge  can  be  a  definite  disadvantage  unless  proper  precautions  are 
taken. 

A  deductive  system  differs  from  a  mere  collection  of  statements  in  that 
it  is  organized  in  a  very  special  way.  The  key  to  the  organization  lies  in  the 
fact  that  all  statements  of  the  system  other  than  the  original  assumptions 
must  be  deducible  from  these  initial  hypotheses,  and  that  if  any  additional 
assumptions  should  creep  into  the  work  the  desired  organization  is  not 
realized.  Now  anyone  formulating  a  deductive  system  knows  more  about 
his  subject  matter  than  just  the  initial  assumptions  he  wishes  to  employ. 
He  has  before  him  a  set  of  statements  belonging  to  his  subject  matter,  some  of 
which  he  selects  for  postulates  and  the  rest  of  which  he  presumably  deduces 
from  his  postulates  as  theorems.  But,  with  a  large  body  of  information  before 
one,  it  is  very  easy  to  employ  in  the  proofs  some  piece  of  this  information  which 
is  not  embodied  in  the  postulates.  Any  piece  of  information  used  in  this  way 
may  be  so  apparently  obvious  or  so  seemingly  elementary  that  it  is  assumed 
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unconsciously.  Such  a  tacit  assumption,  of  course,  spoils  the  rigidity  of  the 
organization  of  the  deductive  system.  Morever,  should  that  piece  of  infor¬ 
mation  involve  some  misconception,  its  introduction  may  lead  to  results 
which  not  only  do  not  strictly  follow  from  the  postulates  but  which  may 
actually  contradict  some  previously  established  theorem.  Herein,  then,  lies 
the  pitfall  of  too  great  a  familiarity  with  the  subject  matter  of  the  discourse; 
at  all  times  in  building  ifp  a  deductive  system  one  must  proceed  with  the 
appearance  of  being  completely  ignorant  of  the  developing  material.  This 
does  not  mean  that  in  building  up  a  deductive  system  one  refrains  from 
making  any  use  of  one’s  intuitive  appreciation  of  the  significance  of  the 
axioms  and  of  possible  interpretations  of  the  primitive  terms.  On  the  contrary 
one  makes  full  use  of  these  things,  but  only  to  conjecture  possible  theorems 
and  possible  avenues  of  investigation.  In  the  actual  establishment  of  these 
theorems  and  in  the  actual  development  of  these  avenues  of  investigation, 
one  must  be  careful  to  proceed  only  in  terms  of  the  accepted  assumptions. 

The  tacit  assumption  by  Euclid  of  something  that  is  not  contained  in 
his  first  principles  is  exemplified  in  the  very  first  deduced  proposition  of  the 
Elements.  In  order  to  examine  the  difficulty  we  shall  quote  Proposition  I  1 
verbatim  from  Heath’s  translation.* 


On  a  given  finite  straight  line  to  construct  an  equilateral  triangle.  (See  Fig.  9.) 
Let  AB  be  the  given  finite  straight  line. 

Thus  it  is  required  to  construct  an  equilateral  triangle  on  the  straight 
line  AB. 

With  center  A  and  distance  AB ,  let  the  circle  BCD  be  described.  [Postulate  3] 
Again,  with  center  B  and  distance  BA,  let  the  circle  ACE  be  described. 
[Postulate  3] 

And  from  the  point  C,  in  which  the  circles  cut  one  another,  to  the  points 


*T.  L.  Heath  [5],  241,  242. 
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A,  B,  let  the  straight  lines  CA,  CB  be  joined.  [Postulate  1] 

Now,  since  the  point  A  is  the  center  of  the  circle  CDB,  AC  is  equal  to  AB. 
[Definition  15] 

Again,  since  the  point  B  is  the  center  of  the  circle  CAE ,  BC  is  equal  to  BA. 
[Definition  15] 

But  CA  was  also  proved  equal  to  AB;  therefore  each  of  the  straight  lines  CA, 
CB  is  equal  to  AB.  And  things  which  are  equal  to  the  same  thing  are  also  equal 
to  one  another;  therefore  CA  is  also  equal  to  CB.  [Axiom  1] 

Therefore  the  three  straight  lines  CA,  AB,  BC  are  equal  to  one  another. 

Therefore  the  triangle  ABC  is  equilateral;  and  it  has  been  constructed  on  the 
given  finite  straight  line  AB. 

(Being)  what  it  was  required  to  do. 

Now  the  construction  of  the  two  circles  in  this  demonstration  is  certainly 
justified  by  Postulate  3,  but  there  is  nothing  in  Euclid’s  first  principles  which 
explicitly  guarantees  that  the  two  circles  shall  intersect  in  a  point  C ,  and 
that  they  will  not,  somehow  or  other,  slip  through  each  other  with  no  common 
point.  The  existence  of  this  point,  then,  must  be  either  postulated  or  proved, 
and  it  can  be  shown  that  Euclid’s  postulates  are  insufficient  to  permit  the 
latter  (see  Problem  2.4.3).  Only  by  the  introduction  of  some  additional 
assumption  can  the  existence  of  the  point  C  be  established.  Therefore  the 
proposition  does  not  follow  from  Euclid’s  first  principles,  and  the  proof  of 
the  proposition  is  invalid. 

The  fallacy  here  lies,  not  in  assuming  something  contrary  to  our  concept 
of  circles,  but  in  assuming  something  which  is  not  implied  by  our  accepted 
first  principles.  This  is  an  example  where  the  tacit  assumption  is  so  evident 
and  elementary  that  there  does  not  appear  to  be  any  assumption.  The  fallacy 
is  a  subtle  one,  but  had  Euclid  known  nothing  more  about  circles  than  what 
his  first  principles  say  of  them  he  certainly  could  not  have  fallen  into  this 
error. 

What  is  needed  here  is  some  additional  postulate  which  will  guarantee 
that  the  two  circles  concerned  will  intersect.  Postulate  5  gives  a  condition 
under  which  two  straight  lines  will  intersect.  We  need  similar  postulates 
telling  when  two  circles  will  intersect  and  when  a  circle  and  a  straight  line 
will  intersect.  Whast  is  essentially  involved  here  is  the  continuity  of  circles 
and  straight  lines,  and  in  modern  treatments  of  geometry  the  existence  of 
the  desired  points  of  intersection  is  taken  care  of  by  some  sort  of  continuity 
postulate. 

Another  tacit  assumption  made  by  Euclid  is  that  the  straight  line  is  of 
infinite  extent.  Although  Postulate  2  asserts  that  a  straight  line  may  be 
produced  indefinitely,  it  does  not  necessarily  imply  that  a  straight  line  is 
infinite  in  extent  but  merely  that  it  is  endless,  or  boundless.  The  arc  of  a 
great  circle  joining  two  points  on  a  sphere  may  be  produced  indefinitely 
along  the  great  circle,  making  the  prolonged  arc  endless,  but  certainly  it 
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is  not  infinite  in  extent.  Now  it  is  conceivable  that  a  straight  line  may  behave 
similarly,  and  that  after  a  finite  prolongation  it,  too,  may  return  on  itself. 
It  was  the  great  German  mathematician  Bernhard  Riemann  (1826-1866) 
who,  in  his  famous  probationary  lecture,  Uber  die  Hypothesen  welche  der 
Geometne  zu  Grunde  liegen ,  of  1854,  distinguished  between  the  boundlessness 
and  the  infinitude  of  straight  lines.  There  are  numerous  occasions  where 
Euclid  unconsciously  assiimes  the  infinitude  of  a  straight  line.  Let  us  briefly 
consider,  for  example,  Proposition  I  16: 

In  any  triangle ,  if  one  of  the  sides  be  produced ,  the  exterior  angle  is  greater  than 
either  of  the  interior  and  opposite  angles. 

A  precis  of  Euclid’s  proof  runs  as  follows.  Let  ABC  (Figure  10)  be  the 
given  triangle,  with  BC  produced  to  D.  Let  E  be  the  midpoint  of  AC.  Draw 


A  F 


FIG.  10 

BE  and  extend  it  its  own  length  to  F.  Draw  CF.  Then  triangles  BEA  and 
FEC  can  easily  be  shown  to  be  congruent,  whence  -fECE  =  fiBAC.  But 
fiACD  >  fFCE,  whence  fACD  >  flBAC.  By  producing  AC  to  G  we 
may  similarly  show  that  fBCG,  which  is  equal  to  f.ACD,  is  also  greater 
than  dp  ABC. 

Now  if  a  straight  line  should  return  on  itself,  like  the  great  circle  arc 
considered  above,  BF  may  be  so  long  that  Twill  coincide  with  B  or  lie  on  the 
segment  BE.  Should  this  be  the  case,  the  proof  would  certainly  fail.  The 
author  has  been  misled  by  his  visual  reference  to  the  figure  rather  than  to  the 
principles  that  should  be  the  basis  of  his  argument.  Clearly,  then,  to  make 
the  proof  universally  valid  we  must  either  prove  or  postulate  the  infinitude 
of  straight  lines. 

One  can  point  out  many  other  tacit  assumptions  which,  like  the  above, 
were  unconsciously  made  by  Euclid  and  which  vitiate  the  true  deductive 
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character  of  his  work.  For  example,  in  Proposition  I  21,  Euclid  unconsciously 
assumes  that  if  a  straight  line  enters  a  triangle  at  a  vertex  it  must,  if  sufficiently 
produced,  intersect  the  opposite  side.  It  was  Moritz  Pasch  (1843-1930)  who 
recognized  the  necessity  of  a  postulate  to  take  care  of  this  situation.  Again, 
Euclid  makes  no  provision  for  linear  order ,  and  his  concept  of  “betweenness” 
is  without  any  postulational  foundation,  with  the  result  that  paradoxes  are 
possible.  We  have  already  pointed  out  that  Postulate  1,  which  guarantees 
the  existence  of  at  least  one  straight  line  joining  two  points  A  and  B,  probably 
was  meant  to  imply  uniqueness  of  this  line,  but  the  postulate  fails  to  assert 
so  much.  Also,  the  objections  that  can  be  raised  against  the  principle  of 
superposition,  still  employed  in  many  popular  textbooks,  are  only  partially 
met  by  Euclid’s  Axiom  4. 

In  short,  the  truth  of  the  matter  is  that  Euclid’s  first  principles  are 
simply  not  sufficient  for  the  derivation  of  all  of  the  465  propositions  of  the 
Elements.  In  particular,  the  set  of  postulates  needs  to  be  considerably  ampli¬ 
fied.  The  work  of  perfecting  Euclid’s  initial  assumptions,  so  that  all  of  his 
geometry  can  rigorously  follow,  occupied  mathematicians  for  more  than  two 
thousand  years.  Not  until  the  end  of  the  nineteenth  century  and  the  early 
part  of  the  twentieth  century,  after  the  foundations  of  geometry  had  been 
subjected  to  an  intensive  study,  were  satisfactory  sets  of  postulates  supplied 
for  Euclidean  plane  and  solid  geometry.  The  history  of  this  struggle  is  of 
major  concern  in  our  present  study,  and  in  following  it  we  shall  encounter 
the  device  that  mathematicians  contrived  for  avoiding  the  pitfall,  into  which 
Euclid  so  often  fell,  of  overfamiliarity  with  the  subject  matter. 

Not  only  is  Euclid’s  work  marred  by  numerous  tacit  assumptions,  but 
some  of  his  preliminary  definitions  are  also  open  to  criticism.  Euclid,  follow¬ 
ing  the  Greek  pattern  of  material  axiomatics,  makes  some  sort  of  attempt  to 
define,  or  at  least  to  explain,  all  the  terms  of  his  discourse.  Now,  actually, 
it  is  as  impossible  to  define  explicitly  all  of  the  terms  of  a  discourse  as  it  is  to 
prove  all  of  the  statements  of  the  discourse,  for  a  term  must  be  defined  by 
means  of  other  terms,  and  these  other  terms  by  means  of  still  others,  and 
so  on.  In  order  to  get  started,  and  to  avoid  circularity  of  definition  where 
term  x  is  defined  by  means  of  term  y,  and  then  later  term  y  by  means  of 
term  x,  one  is  forced  to  set  down  at  the  very  start  of  the  discourse  a  collection 
of  primitive,  or  basic,  terms  whose  meanings  are  not  to  be  questioned.  All 
subsequent  terms  of  the  discourse  must  be  defined,  ultimately,  by  means  of 
these  initial  primitive  ones.  The  postulates  of  the  discourse  are,  then,  in  final 
analysis,  assumed  statements  about  the  primitive  terms.  From  this  point  of 
view,  the  primitive  terms  may  be  regarded  as  defined  implicitly ,  in  the  sense 
that  they  are  any  things  or  concepts  which  satisfy  the  postulates,  and  this 
implicit  definition  is  the  only  kind  of  definition  that  the  primitive  terms 
can  receive. 

In  Euclid’s  development  of  geometry  the  terms  point  and  line,  for 
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example,  could  well  have  been  included  in  a  set  of  primitive  terms  for  the 
discourse.  At  any  rate,  Euclid’s  definition  of  a  point  as  “that  which  has  no 
part”  and  of  a  line  as  “length  without  breadth”  are  easily  seen  to  be  circular 
and  therefore,  from  a  logical  viewpoint,  woefully  inadequate.  One  dis¬ 
tinction  between  the  Greek  conception  and  the  modern  conception  of  the 
axiomatic  method  lies  in  this  matter  of  primitive  terms;  in  the  Greek  con¬ 
ception  there  is  no  listing'of  the  primitive  terms.  The  excuse  for  the  Greeks 
is  that  to  them  geometry  was  not  just  an  abstract  study,  but  was  an  attempted 
logical  analysis  of  idealized  physical  space.  Points  and  lines  were,  to  the 
Greeks,  idealizations  of  very  small  particles  and  of  very  thin  threads.  It  is 
this  idealization  that  Euclid  attempts  to  express  in  his  two  initial  definitions. 

Other  differences  between  the  Greek  and  the  modern  views  of  the 
axiomatic  method  will  be  discussed  in  a  later  chapter. 

2.5  The  End  of  the  Greek  Period 

and  the  Transition 

to  Modern  Times* 

Very  little  in  the  further  development  of  the  axiomatic  method  took 
place  after  Euclid  until  relatively  modern  times.  We  must  mention,  however, 
the  brilliant  exploitation  of  the  method  by  Archimedes  (287-212  b.c.),  one 
of  the  greatest  mathematicians  of  all  time,  and  certainly  the  greatest  of 
antiquity.  Although  Archimedes  lived  most  of  his  long  life  in  the  Greek 
city  of  Syracuse,  on  the  island  of  Sicily,  it  seems  that  he  studied  for  a  time  at 
the  University  of  Alexandria.  He  was  thoroughly  schooled  in  the  Euclidean 
tradition,  and  he  left  deep  imprints  on  both  geometry  and  mechanics. 
Archimedes’  works  are  masterpieces  of  mathematical  exposition  and  resemble 
to  a  remarkable  extent,  because  of  their  high  finish,  economy  of  presentation, 
and  rigor  in  demonstration,  the  articles  found  in  present-day  research 
journals.  It  is  interesting  that  Archimedes  employed  the  axiomatic  method 
in  his  writings  on  theoretical  mechanics,  as  well  as  in  his  purely  geometrical 
studies,  always  laying  down  the  first  principles  of  the  work  and  then  deducing 
a  sequence  of  propositions.  Thus,  in  his  treatise  On  Plane  Equilibriums , 
Archimedes  establishes  25  theorems  of  mechanics  on  the  basis  of  three  simple 
postulates  suggested  by  common  experience.  The  postulates  are  as  follows: 
(1)  Equal  weights  at  equal  distances  balance;  equal  weights  at  unequal 
distances  do  not  balance  but  incline  toward  the  weight  which  is  at  the  greater 
distance.  (2)  If,  when  weights  at  certain  distances  balance,  something  be 
added  to  one  of  the  weights,  equilibrium  will  not  be  maintained,  but  there 
will  be  inclination  on  the  side  of  the  weight  to  which  the  addition  was  made; 
similarly,  if  anything  be  taken  away  from  one  of  the  weights,  there  will  be 

*This  section  is  largely  skimmed  from  the  appropriate  places  in  H.  Eves. 
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inclination  on  the  side  of  that  weight  from  which  nothing  was  taken. 
(3)  When  equal  and  similar  plane  figures  coincide  if  placed  on  one  another, 
their  centroids  similarly  coincide;  and  in  figures  which  are  unequal  but 
similar  the  centroids  will  be  similarly  situated.  From  these  simple  postulates 
Archimedes  locates,  for  example,  the  centroid  of  any  parabolic  segment  and 
of  any  portion  of  a  parabola  lying  between  two  parallel  chords.  Problems 
of  this  sort  would  today  be  worked  out  by  means  of  the  integral  calculus. 

Again,  in  his  work  On  Floating  Bodies ,  Archimedes  rests  the  establishment 
of  the  19  propositions  of  the  work  on  two  fundamental  postulates.  This 
treatise  is  the  first  recorded  application  of  mathematics  to  hydrostatics,  and 
it  begins  by  developing  those  familiar  laws  of  hydrostatics  which  nowadays 
are  encountered  in  an  elementary  physics  course.  The  treatise  then  goes  on 
to  consider  several  rather  difficult  problems,  culminating  with  a  remarkable 
investigation  of  the  positions  of  rest  and  of  stability  of  a  right  segment  of  a 
paraboloid  of  revolution  floating  in  a  fluid.  Not  until  the  sixteenth-century 
work  of  Simon  Stevin  did  the  science  of  statics  and  the  theory  of  hydro¬ 
dynamics  appreciably  advance  beyond  the  points  reached  by  Archimedes. 
It  is  worthy  of  note  that  these  early  researches  in  theoretical  physics  were 
developed  by  the  use  of  the  axiomatic  method. 

There  is  a  geometrical  assumption  explicitly  stated  by  Archimedes  in 
his  work  On  the  Sphere  and  Cylinder  which  deserves  special  mention;  it  is  one 
of  the  five  postulates  assumed  at  the  start  of  Book  I  of  the  work  and  it  has 
become  known  as  the  postulate  of  Archimedes.  A  simple  statement  of  the  postu¬ 
late  is  as  follows:  Given  two  unequal  linear  segments ,  there  is  always  some  finite 
multiple  of  the  shorter  one  which  is  longer  than  the  other.  In  some  modern  treatments 
of  geometry  this  postulate  serves  as  part  of  the  postulational  basis  for  intro¬ 
ducing  the  concept  of  continuity.  It  is  a  matter  of  interest  that  in  the 
nineteenth  and  twentieth  centuries  geometric  systems  were  constructed  which 
denied  the  Archimedean  postulate,  thus  giving  rise  to  so-called  non-Archi- 
medean  geometries.  Although  named  after  Archimedes,  this  postulate  had 
been  considered  earlier  by  Eudoxus. 

There  were  other  able  Greek  mathematicians  in  ancient  times  after 
Euclid  besides  Archimedes— for  example,  Apollonius,  Eratosthenes, 
Menelaus,  Claudius  Ptolemy,  Heron,  Diophantus,  and  Pappus— but  these 
men  did  little  to  advance  the  development  of  the  axiomatic  method  and  so 
have  slight  connection  with  our  present  study.  After  Pappus,  who  flourished 
toward  the  end  of  the  third  century  a.d.,  Greek  mathematics  practically 
ceased  as  a  living  study,  and  thenceforth  merely  its  memory  was  perpetuated 
by  minor  writers  and  commentators,  such  as  Theon  and  Proclus.  This  closing 
period  of  ancient  times  was  dominated  by  Rome.  One  Greek  center  after 
another  had  fallen  before  the  power  of  the  Roman  armies;  in  146  b.c.  Greece 
had  become  a  province  of  the  Roman  Empire,  although  Mesopotamia  was 
not  conquered  until  65  b.c.,  and  Egypt  held  out  until  30  b.c.  The  economic 
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structure  of  the  empire  was  based  essentially  on  agriculture  and  an  increasing 
use  of  slave  labor.  Conditions  proved  more  and  more  stifling  to  original 
scientific  work,  and  a  gradual  decline  in  creative  thinking  set  in.  The 
eventual  collapse  of  the  slave  market,  with  its  disastrous  effect  on  Roman 
economy,  found  science  reduced  to  a  mediocre  level.  The  famous  Alexandrian 
school  gradually  faded  with  the  breakup  of  ancient  society,  and  finally,  in 
641  a.d.,  Alexandria  was  taken  by  the  Arabs,  who  put  the  torch  to  what  the 
Christians  had  left.  The  long  and  glorious  era  of  Greek  mathematics  was  over. 

The  period  starting  with  the  fall  of  the  Roman  Empire  in  the  middle 
of  the  fifth  century  and  extending  into  the  eleventh  century  is  known  as 
Europe’s  Dark  Ages,  for  during  this  period  civilization  in  western  Europe 
reached  a  very  low  ebb.  Schooling  became  almost  nonexistent,  Greek  learn¬ 
ing  all  but  disappeared,  and  many  of  the  arts  and  crafts  bequeathed  by  the 
ancient  world  were  forgotten.  Only  the  monks  of  the  Catholic  monasteries, 
and  a  few  cultured  laymen,  preserved  a  slender  thread  of  Greek  and  Latin 
learning.  The  period  was  marked  by  great  physical  violence  and  intense 
religious  faith.  The  old  social  order  gave  way,  and  society  became  feudal 
and  ecclesiastical. 

The  Romans  had  never  taken  to  abstract  mathematics  but  had  con¬ 
tented  themselves  with  merely  a  few  practical  aspects  of  the  subject  that  were 
associated  with  commerce  and  civil  engineering.  With  the  fall  of  the  Roman 
Empire  and  the  subsequent  closing  of  much  of  east-west  trade  and  the 
abandonment  of  state  engineering  projects,  even  these  interests  waned,  and  it 
is  no  exaggeration  to  say  that  very  little  in  mathematics,  beyond  the  develop¬ 
ment  of  the  Christian  calendar,  was  accomplished  in  the  West  during  the 
whole  of  the  half  millennium  covered  by  the  Dark  Ages. 

During  this  bleak  period  of  learning  the  people  of  the  east,  especially 
the  Hindus  and  the  Arabs,  became  the  major  custodians  of  mathematics. 
However,  the  Greek  concept  of  rigorous  thinking— in  fact,  the  very  idea  of 
proof— seemed  distasteful  to  the  Hindu  way  of  doing  things.  Although  the 
Hindus  excelled  in  computation,  contributed  to  the  devices  of  algebra,  and 
played  an  important  role  in  developing  our  present  positional  numeral 
system,  they  produced  nothing  of  importance  so  far  as  basic  methodology 
is  concerned.  Hindu  mathematics  of  this  period  is  largely  empirical  and 
lacks  those  outstanding  Greek  characteristics  of  clarity  and  logicality  in 
presentation  and  of  insistence  on  rigorous  demonstration. 

The  spectacular  episode  of  the  rise  and  decline  of  the  Arabian  empire 
occurred  during  the  period  of  Europe’s  Dark  Ages.  Within  a  decade  following 
Mohammed’s  flight  from  Mecca  to  Medina  in  622  a.d.,  the  scattered  and 
disunited  tribes  of  the  Arabian  peninsula  were  consolidated  by  a  strong 
religious  fervor  into  a  powerful  nation.  Within  a  century,  force  of  arms  had 
extended  the  Moslem  rule  and  influence  over  a  territory  reaching  from  India, 
through  Persia,  Mesopotamia,  and  northern  Africa,  clear  into  Spain.  Of 
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considerable  importance  for  the  preservation  of  much  of  world  culture  was 
the  manner  in  which  the  Arabs  seized  upon  Greek  and  Hindu  erudition. 
The  Bagdad  caliphs  not  only  governed  wisely  and  well  but  many  became 
patrons  of  learning  and  invited  distinguished  scholars  to  their  courts. 
Numerous  Hindu  and  Greek  works  in  astronomy,  medicine,  and  mathematics 
were  industriously  translated  into  the  Arabic  tongue  and  thus  were  saved 
until  later  European  scholars  were  able  to  retranslate  them  into  Latin  and 
other  languages.  But  for  the  work  of  the  Arabian  scholars  a  great  part  of 
Greek  and  Hindu  science  would  have  been  irretrievably  lost  over  the  long 
period  of  the  Dark  Ages. 

Not  until  the  latter  part  of  the  eleventh  century  did  Greek  classics  in 
science  and  mathematics  begin  once  again  to  filter  into  Europe.  There 
followed  a  period  of  transmission  during  which  the  ancient  learning  pre¬ 
served  by  Moslem  culture  was  passed  on  to  the  western  Europeans  through 
Latin  translations  made  by  Christian  scholars  traveling  to  Moslem  centers 
of  learning,  and  through  the  opening  of  western  European  commercial 
relations  with  the  Levant  and  the  Arabian  world.  The  loss  of  Toledo  by 
the  Moors  to  the  Christians  in  1085  was  followed  by  an  influx  of  Christian 
scholars  to  that  city  to  acquire  Moslem  learning.  Other  Moorish  centers  in 
Spain  were  infiltrated,  and  the  twelfth  century  became,  in  the  history  of 
mathematics,  a  century  of  translators.  One  of  the  most  industrious  translators 
of  the  period  was  Gherardo  of  Cremona,  who  translated  into  Latin  more 
than  90  Arabian  works,  among  which  were  Ptolemy’s  Almagest  and  Euclid’s 
Elements.  At  the  same  time  Italian  merchants  came  in  close  contact  with 
eastern  civilization,  thereby  picking  up  useful  arithmetical  and  algebraical 
information.  These  merchants  played  an  important  part  in  the  European 
dissemination  of  the  Hindu-Arabic  system  of  numeration. 

The  thirteenth  century  saw  the  rise  of  the  universities  at  Paris,  Oxford, 
Cambridge,  Padua,  and  Naples.  Universities  were  to  become  potent  factors 
in  the  development  of  mathematics,  since  many  mathematicians  associated 
themselves  with  one  or  more  such  institutions.  During  this  century  Campanus 
made  a  Latin  translation  of  Euclid’s  Elements  which  later,  in  1482,  became 
the  first  printed  version  of  Euclid’s  great  work. 

The  fourteenth  century  was  a  mathematically  barren  one.  It  was  the 
century  of  the  Black  Death,  which  swept  away  more  than  a  third  of  the 
population  of  Europe,  and  during  this  century  the  Hundred  Years  War, 
with  its  political  and  economic  upheavals  in  northern  Europe,  got  well 
under  way. 

The  fifteenth  century  witnessed  the  beginning  of  the  European  Renais¬ 
sance  in  art  and  learning.  With  the  collapse  of  the  Byzantine  Empire, 
culminating  in  the  fall  of  Constantinople  to  the  Turks  in  1453,  refugees 
flowed  into  Italy,  bringing  with  them  treasures  of  Greek  civilization.  Many 
Greek  classics,  up  to  that  time  known  only  through  the  often  inadequate 
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Arabic  translations,  could  now  be  studied  from  original  sources.  Also,  about 
the  middle  of  the  century,  occurred  the  invention  of  printing,  which  revolu¬ 
tionized  the  book  trade  and  enabled  knowledge  to  be  disseminated  at  an 
unprecedented  rate.  Mathematical  activity  in  this  century  was  largely 
centered  in  the  Italian  cities  and  in  the  central  European  cities  of  Nuremberg, 
Vienna,  and  Prague,  and  it  concentrated  on  arithmetic,  algebra,  and  trigo¬ 
nometry,  under  the  practical  influence  of  trade,  navigation,  astronomy,  and 

, 

surveying. 

In  the  sixteenth  century  the  development  of  arithmetic  and  algebra 
continued,  the  most  spectacular  mathematical  achievement  of  the  century 
being  the  discovery,  by  Italian  mathematicians,  of  the  algebraic  solution  of 
cubic  and  quartic  equations.  In  1572  Commandino  made  a  very  important 
Latin  translation  of  Euclid’s  Elements  from  the  Greek.  This  translation  served 
as  a  basis  for  many  subsequent  translations,  including  a  very  influential 
work  by  Robert  Simson,  from  which,  in  turn,  so  many  English  editions 
were  derived. 

The  seventeenth  century  proved  to  be  particularly  outstanding  in  the 
history  of  mathematics.  Early  in  the  century  Napier  revealed  his  invention 
of  logarithms,  Harriot  and  Oughtred  contributed  to  the  notation  and  codi¬ 
fication  of  algebra,  Galileo  founded  the  science  of  dynamics,  and  Kepler 
announced  his  laws  of  planetary  motion.  Later  in  the  century  Desargues  and 
Pascal  opened  a  new  field  of  pure  geometry,  Descartes  launched  modern 
analytic  geometry,  Fermat  laid  the  foundations  of  modern  number  theory, 
and  Huygens  made  distinguished  contributions  to  the  theory  of  probability 
and  other  fields.  Then,  toward  the  end  of  the  century,  after  many  mathe¬ 
maticians  had  prepared  the  way,  the  epoch-making  creation  of  the  calculus 
was  made  by  Newton  and  Leibniz.  Thus,  during  the  seventeenth  century, 
many  new  and  vast  fields  were  opened  for  mathematical  investigation.  The 
dawn  of  modern  mathematics  was  at  hand,  and  it  was  perhaps  inevitable 
that  sooner  or  later  some  aspect  of  the  axiomatic  method  itself  should  once 
again  claim  the  attention  of  researchers. 

1 

_ PROBLEMS 

2.1.1  Which  of  the  following  two  theorems  should  more  likely  appear 
among  the  “elements”  of  a  course  in  plane  geometry,  and  why?  (1)  The 
three  altitudes  of  a  triangle,  produced  if  necessary,  meet  in  a  point.  (2)  The 
sum  of  the  three  angles  of  a  triangle  is  equal  to  two  right  angles. 

2.1.2  A  mathematics  instructor  is  going  to  present  the  subject  of  geometric 
progressions  to  his  college  algebra  class.  After  defining  this  type  of  pro¬ 
gression,  what  theorems  about  geometric  progressions  should  the  instructor 
offer  as  the  “elements”  of  the  subject? 

2.1.3  Imagine  yourself  building  up  an  elementary  treatment  ol  trigono- 
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metric  identities.  Which  identities  would  you  select  for  the  “elements”  of 
your  treatment,  and  in  what  order  would  you  arrange  them? 

2.1.4  As  an  illustration  of  nongeometrical  material  found  in  Euclid’s 
Elements ,  let  us  consider  the  Euclidean  algorism ,  or  process,  for  finding  the 
greatest  common  integral  divisor  (g.c.d.)  of  two  positive  integers.  The  process 
is  found  at  the  start  of  Euclid’s  Book  VII,  although  perhaps  it  was  known 
before  Euclid’s  time.  This  algorism  is  basic  to  several  developments  in  modern 
mathematics.  Stated  in  the  form  of  a  rule,  the  process  is  this:  Divide  the  larger 
of  the  two  positive  integers  by  the  smaller  one.  Then  divide  the  divisor  by  the  remainder. 
Continue  this  process ,  of  dividing  the  last  divisor  by  the  last  remainder ,  until  the 
division  is  exact.  The  final  divisor  is  the  sought  g.c.d.  of  the  two  original  positive  integers. 

(a)  Find,  by  the  Euclidean  algorism,  the  g.c.d.  of  5913  and  7592. 

(b)  Find,  by  the  Euclidean  algorism,  the  g.c.d.  of  1827,  2523,  and  3248. 

(c)  Prove  that  the  Euclidean  algorism  does  lead  to  the  g.c.d. 

(d)  Let  h  be  the  g.c.d.  of  the  positive  integers  a  and  b.  Show  that  there 
exist  integers  p  and  q  (not  necessarily  positive)  such  that  pa  +  qb  —  h. 

(e)  Find  p  and  q  for  the  integers  of  part  (a). 

(f)  Prove  that  a  and  b  are  relatively  prime  if  and  only  if  there  exist 
integers  p  and  q  such  that  pa  +  qb  =  1 . 

2.1.5  (a)  Prove,  using  Problem  2.1.4  (f),  that  if  p  is  a  prime  and  divides 
the  product  uv  then  either  p  divides  u  or  p  divides  v. 

(b)  Prove,  from  part  (a),  the  “fundamental  theorem  of  arithmetic”: 
Every  integer  greater  than  1  can  be  uniquely  factored  into  a  product  of  primes.  This 
is  essentially  Proposition  IX  14  of  Euclid’s  Elements. 

2.1.6  The  fundamental  theorem  of  arithmetic  says  that,  for  any  given 
positive  integer  a ,  there  are  unique  non-negative  integers  a i,  a 2,  a 3,  •  •  •,  only 
a  Unite  number  of  which  are  different  from  zero,  such  that 

a  =  2a'3“25a3  •  •  •, 

where  2,  3,  5,  •  •  •  are  the  consecutive  primes.  This  suggests  a  useful  notation. 

We  shall  write  ,  \ 

a  —  (fli,  a 2,  03  •  •  * ,  an), 

where  an  is  the  last  nonzero  exponent.  Thus  we  have  12  =  (2,  1),  14  = 
(1,  0,  0,  1),  27  =  (0,  3),  and  360  =  (3,  2,  1). 

Prove  the  following  theorems: 

(a)  ab  =  (a,  +  bh  a2  +  b2,  •  •  •)• 

(b)  b  is  a  divisor  of  a  if  and  only  if  bi  ^  ai  for  each  i. 

(c)  The  number  of  divisors  of  a  is  («i  +  l)(a2  +  1)  •  •  •  {an  +  1). 

(d)  A  necessary  and  sufficient  condition  for  a  number  n  to  be  a  perfect 
square  is  that  the  number  of  divisors  of  n  be  odd. 

(e)  Setgt  equal  to  the  smaller  of  ah  and  bi  if  ai  7^  bi  and  equal  to  either 
ai  or  bi  if  ai  —  b Then  g  =  (gh  g2,  •  •  •)  is  the  g.c.d.  of  a  and  b. 

(0  If  a  and  b  are  relatively  prime  and  b  divides  ac,  then  b  divides  c. 

(g)  if  a  and  b  are  relatively  prime  and  if  a  divides  c  and  b  divides  c, 
then  ab  divides  c. 

(h)  Show  that  V2  and  V3  are  irrational. 
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2.1.7  Prove  the  famous  Proposition  IX  20  of  Euclid’s  Elements:  The  number 
of  prime  numbers  is  infinite. 

2.1.8  A  number  is  said  to  be  perfect  if  it  is  the  sum  of  its  proper  divisors. 
For  example,  6  is  a  perfect  number,  since  6  =  1  +  2  +  3.  The  last  proposi¬ 
tion  of  the  ninth  book  of  Euclid’s  Elements  proves  that  if  2n  —  1  is  a  prime 
number ,  then  2n~1(2"  —  1)  is  a  perfect  number.  The  perfect  numbers  given  by 
Euclid’s  formula  are  evert  numbers,  and  it  has  been  shown  that  every  even 
perfect  number  must  be  of  this  form.  The  existence  or  nonexistence  of  odd 
perfect  numbers  is  one  of  the  celebrated  unsolved  problems  in  number  theory. 
There  is  no  number  of  this  type  less  than  a  trillion. 

(a)  Show  that  in  Euclid’s  formula  for  perfect  numbers,  n  must  be  prime. 

(b)  What  are  the  first  four  perfect  numbers  given  by  Euclid’s  formula? 

(c)  Prove  that  the  sum  of  the  reciprocals  of  all  the  divisors  of  a  perfect 
number  is  equal  to  2. 

2.2.1  Discuss  Euclid’s  axioms  and  postulates,  as  listed  in  Section  2.3,  in 
relation  to  the  three  distinctions  that,  according  to  Proclus,  were  advocated 
by  various  early  Greeks. 

2.3.1  How  does  the  modern  definition  of  a  circle  differ  from  Euclid’s 
definition? 

2.3.2  “Prove”  Euclid’s  Postulate  4  by  the  method  of  superposition. 

2.3.3  One  should  understand  precisely  the  intention  of  Euclid’s  Postulate  3. 
When  Euclid  says  that  “a  circle  may  be  described  with  any  center  and  dis¬ 
tance,”  he  means  that  a  circle  may  be  described  with  any  point  as  center  and  having 
any  straight  line  segment  radiating  from  this  center  as  a  radius.  It  follows  that  the 
Euclidean  compasses  differ  from  our  modern  compasses,  for  with  the  modern 
compasses  we  are  permitted  to  draw  a  circle  having  any  point  A  as  center 
and  any  segment  BC  as  radius.  In  other  words,  we  are  permitted  to  transfer 
the  distance  BC  to  the  center  A,  using  the  compasses  as  dividers.  The 
Euclidean  compasses,  on  the  other  hand,  may  be  supposed  to  collapse  if 
either  leg  is  lifted  from  the  paper. 

A  student  reading  Euclid’s  Elements  for  the  first  time  might  experience 
surprise  at  the  opening  propositions  of  Book  I.  The  first  three  propositions 
are  the  construction  problems: 

(1)  To  describe  an  equilateral  triangle  upon  a  given  finite  straight  line. 

(2)  From  a  given  point  to  draw  a  straight  line  equal  to  a  given  straight  line. 

(3)  From  the  greater  of  two  given  straight  lines  to  cut  off  a  part  equal  to  the  lesser . 

These  three  constructions  are  trivial  with  straightedge  and  modern  com¬ 
passes,  but  require  some  ingenuity  with  straightedge  and  Euclidean  compasses. 

(a)  Solve  Proposition  1  of  Book  I  with  Euclidean  tools. 

(b)  Solve  Proposition  2  of  Book  I  with  Euclidean  tools. 

(c)  Solve  Proposition  3  of  Book  I  with  Euclidean  tools. 

(d)  Show  that  Proposition  2  of  Book  I  proves  that  the  straightedge  and 

Euclidean  compasses  are  equivalent  to  the  straightedge  and  modern  compasses. 
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FIG.  11  FIG.  12 

2.4.1  If  an  assumption  tacitly  made  in  a  deductive  development  should 
involve  a  misconception,  its  introduction  may  lead  not  only  to  a  result  which 
does  not  follow  from  the  postulates  of  the  deductive  system  but  to  one  which 
may  actually  contradict  some  previously  established  theorem  of  the  system. 
From  this  point  of  view,  criticize  the  following  three  geometrical  paradoxes: 

(a)  To  prove  that  any  triangle  is  isosceles. 

Let  ABC  be  any  triangle  (see  Figure  11).  Draw  the  bisector  of  <£C  and 
the  perpendicular  bisector  of  side  AB.  From  their  point  of  intersection  E, 
drop  perpendiculars  EF  and  EG  on  AC  and  BC,  respectively,  and  draw  EA 
and  EB.  Now  right  triangles  CEE  and  CGE  are  congruent,  since  each  has 
CE  as  hypotenuse  and  since  <)C FCE  =  ^iGCE.  Therefore  CF  =  CG.  Again, 
right  triangles  EFA  and  EGB  are  congruent,  since  leg  EF  of  one  equals 
leg  EG  of  the  other  (any  point  E  on  the  bisector  of  an  angle  C  is  equidistant 
from  the  sides  of  the  angle)  and  since  hypotenuse  EA  of  one  equals  hypotenuse 
EB  of  the  other  (any  point  E  on  the  perpendicular  bisector  of  a  line  segment 
AB  is  equidistant  from  the  extremities  of  that  line  segment).  Therefore 
FA  =  GB.lt  now  follows  that  CF  +  FA  =  CG  +  GB,  or  CA  =  CB,  and 
the  triangle  is  isosceles. 

(b)  To  prove  that  a  right  angle  is  equal  to  an  obtuse  angle. 

Let  ABCD  be  any  rectangle  (see  Figure  12).  Draw  BE  outside  the  rec¬ 
tangle  and  equal  in  length  to  BC ,  and  hence  to  AD.  Draw  the  perpendicular 
bisectors  of  DE  and  AB\  since  they  are  perpendicular  to  nonparallel  lines, 
they  must  intersect  in  a  point  P.  Draw  AP,  BP,  DP,  EP.  Then  PA  =  PB 
and  PD  =  PE  (any  point  on  the  perpendicular  bisector  of  a  line  segment 
is  equidistant  from  the  extremities  of  the  line  segment).  Also,  by  construction, 
AD  —  BE.  Therefore  triangles  APD  and  BPE  are  congruent,  since  the  three 
sides  of  one  are  equal  to  the  three  sides  of  the  other.  Hence  DAP  =  <£  EBP. 
But  ^iBAP  =  <£  ABP,  since  these  angles  are  base  angles  of  the  isosceles 
triangle  APB.  By  subtraction  it  now  follows  that  right  angle  DAG  —  obtuse 
angle  EBA. 
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(c)  To  prove  that  there  are  two  perpendiculars  from  a  point  to  a  line. 

Let  any  two  circles  intersect  in  A  and  B  (see  Figure  13).  Draw  the 
diameters  AC  and  AD,  and  let  the  join  of  C  and  D  cut  the  respective  circles 
in  M  and  N.  Then  angles  AMC  and  AND  are  right  angles,  since  each  is 
inscribed  in  a  semicircle.  Hence  AM  and  AN  are  two  perpendiculars  to  CD. 

2.4.2  To  guarantee  the  existence  of  certain  points  of  intersection  (of  line 
with  circle  and  circle  with  circle)  Richard  Dedekind  (1831—1916)  introduced 
into  geometry  the  following  continuity  postulate:  If  all  points  of  a  straight  line 
fall  into  two  classes,  such  that  every  point  of  the  first  class  lies  to  the  left  of  every  point 
of  the  second  class,  then  there  exists  one  and  only  one  point  which  produces  this  division 
of  all  points  into  two  classes,  that  is,  this  severing  of  the  straight  line  into  two  portions. 

(a)  Complete  the  details  of  the  following  indicated  proof  of  the  theorem: 
The  straight  line  segment  joining  a  point  A  inside  a  circle  to  a  point  B  outside  the 
circle  has  a  point  in  common  with  the  circle. 

Let  0  be  the  center  and  r  the  radius  of  the  given  circle  (see  Figure  14), 
and  let  C  be  the  foot  of  the  perpendicular  from  0  on  the  line  determined 
by  A  and  B.  The  points  of  the  segment  AB  can  be  divided  into  two  classes: 
those  points  P  for  which  OP  <  r  and  those  points  Q  for  which  OQ  ^  r. 
It  can  be  shown  that,  in  every  case,  CP  <  CQ.  Hence,  by  Dedekind’s 
postulate,  there  exists  a  point  R  of  AB  such  that  all  points  which  precede  it 
belong  to  one  class  and  all  which  follow  it  belong  to  the  other  class.  Now 
OR  <  r,  for  otherwise  we  could  choose  S  on  AB,  between  R  and  B,  such 
that  RS  <  r  —  OR.  But,  since  OS  <  OR  +  RS,  this  would  imply  the  absurdity 
that  OS  <  r.  Similarly,  it  can  be  shown  that  OR  >  r.  Hence  we  must  have 
OR  —  r,  and  the  theorem  is  established. 

(b)  How  might  Dedekind’s  postulate  be  extended  to  cover  angles? 

(c)  How  might  Dedekind’s  postulate  be  extended  to  cover  circular  arcs? 

2.4.3  Let  us,  for  convenience,  restate  Euclid’s  first  three  postulates  in  the 
following  equivalent  forms: 

(1)  Any  two  distinct  points  determine  a  straight  line. 

(2)  A  straight  line  is  boundless. 

(3)  There  exists  a  circle  having  any  given  point  as  center  and  passing  through 
any  second  given  point. 
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Show  that  Euclid’s  postulates,  partially  restated  above,  hold  if  the  points 
of  the  plane  are  restricted  to  those  whose  rectangular  Cartesian  coordinates 
for  some  fixed  frame  of  reference  are  rational  numbers.  Show,  however,  that 
under  this  restriction  a  circle  and  a  line  through  its  center  need  not  intersect 
each  other. 

2.4.4  Show  that  Euclid’s  postulates  (as  partially  restated  in  Problem  2.4.3) 
hold  if  we  interpret  the  plane  as  the  surface  of  a  sphere,  straight  lines  as 
great  circles  on  the  sphere,  and  points  as  points  on  the  sphere.  Show,  however, 
that  in  this  interpretation  the  following  are  true: 

(a)  Parallel  lines  do  not  exist. 

(b)  All  perpendiculars  to  a  given  line  erected  on  one  side  of  the  line 
intersect  in  a  point. 

(c)  It  is  possible  to  have  two  distinct  lines  joining  the  same  two  points. 

(d)  The  sum  of  the  angles  of  a  triangle  exceeds  two  right  angles. 

(e)  There  exist  triangles  having  all  three  angles  right  angles. 

(0  An  exterior  angle  of  a  triangle  is  not  always  greater  than  either  of 
the  two  remote  interior  angles. 

(g)  The  sum  of  two  sides  of  a  triangle  can  be  less  than  the  third  side. 

(h)  A  triangle  with  a  pair  of  equal  angles  may  have  the  sides  opposite 
them  unequal. 

(i)  The  greatest  side  of  a  triangle  does  not  necessarily  lie  opposite  the 
greatest  angle  of  the  triangle. 

2.4.5  In  1882  Moritz  Pasch  formulated  the  following  postulate:  Let  A,  B,  C 

be  three  points  not  lying  in  the  same  straight  line ,  and  let  m  be  a  straight  line  lying  in 
the  plane  of  ABC  and  not  passing  through  any  of  the  points  A,  B,  C.  Then ,  if  the 
line  m  passes  through  a  point  of  the  segment  AB,  it  will  also  pass  through  a  point 
of  the  segment  BC  or  a  point  of  the  segment  AC.  This  postulate  is  one  of  those 
assumptions  classified  by  modern  geometers  as  a  postulate  of  order ,  and  it 
assists  in  bringing  out  the  idea  of  “betweenness.” 

(a)  Prove,  as  a  consequence  of  Pasch’s  postulate,  that  if  a  line  enters  a 
triangle  at  a  vertex ,  it  must  cut  the  opposite  side. 
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(b)  Show  that  Pasch’s  postulate  does  not  always  hold  for  a  spherical 
triangle  cut  by  a  great  circle. 

2.4.6  New  terms  defined  by  means  of  more  primitive  terms  are  not  essential 
to  a  deductive  system  but  are  convenient  in  the  drawing  of  inferences,  for 
the  new  terms  serve  as  shorthand  for  complex  and  unmanageable  groups  of 
the  more  primitive  terms.  To  illustrate  the  cumbersomeness  that  would 
result  if,  in  Euclid’s  Elements ,  we  should  dispense  with,  say,  the  terms  point 
and  line,  we  would  have  to  describe  a  straight  line  as  “a  breadthless  length 
which  lies  evenly  with  all  the  entities  on  itself  which  have  no  parts.”  State 
Euclid’s  first  two  postulates  without  using  the  terms  point  and  line. 

2.5.1  Let  Wi  and  W2  be  two  weights  at  distances  dx  and  d2,  respectively, 
from  a  fulcrum.  On  the  basis  of  the-  first  two  postulates  of  Archimedes’ 
treatise  On  Plane  Equilibriums ,  establish  the  following  theorems: 

(a)  If  we  have  equilibrium  and  W i  =  Its,  then  d\  =  dt. 

(b)  If  we  have  equilibrium  and  d\  —  d2,  then  W\  —  W2. 

(c)  If  we  have  equilibrium  and  W\  7^  W2,  then  d\  ^  d2. 

(d)  If,  when  weights  at  certain  distances  balance,  one  of  the  distances 
should  be  increased,  equilibrium  will  not  be  maintained,  but  there  will  be 
inclination  on  the  side  of  the  distance  which  was  increased. 

(e)  Try  to  show  that  we  have  equilibrium  if  and  only  if  W \d\  —  W  2d2. 

2.5.2  (a)  Assuming  the  existence  of  the  usual  one-to-one  correspondence 
between  real  numbers  and  points  on  an  v-axis,  show  that  an  arithmetized 
form  of  the  postulate  of  Archimedes  is,  “If  a  and  b  are  any  two  positive  real 
numbers,  there  exists  a  positive  integer  n  such  that  na  >  bP 

(b)  State  the  postulate  of  Archimedes  for  angles,  and  indicate  how  it 
might  be  deduced  from  the  arithmetized  form  of  the  postulate. 


_  CHAPTER  THREE 

NON-EUCLIDEAN  GEOMETRY 


3.1  Euclid’s  Fifth  Postulate 

Postulate  5  of  Euclid’s  Elements  has  been  described  as  “perhaps  the 
most  famous  single  utterance  in  the  history  of  science.”  *  Certainly  it  has  been 
the  source  of  much  controversy,  and  the  dissatisfaction  of  mathematical 
scholars  with  its  statement  as  a  postulate  is  indicated  by  the  fact  that  many 
reputable  geometers  attempted  over  a  period  of  some  twenty  centuries  either 
to  prove  it  as  a  theorem  or  to  replace  it  by  a  more  acceptable  equivalent. 
As  we  shall  soon  see,  this  concern  over  Euclid’s  fifth  postulate  furnished  the 
stimulus  for  the  development  of  a  great  deal  of  modern  mathematics,  and 
also  led  to  deep  and  revealing  inquiries  into  the  logical  and  philosophical 
foundations  of  the  subject. 

A  rigorous  development  of  the  theory  of  parallels  apparently  gave  the 
early  Greeks  considerable  trouble.  Euclid  met  the  difficulties  by  defining 
parallel  lines  as  coplanar  straight  lines  which  do  not  meet  one  another 
however  far  they  may  be  produced  in  either  direction,  and  by  adopting  as  an 
assumption  his  now  famous  fifth  postulate.  Proclus  tells  us  that  this  postulate 
was  attacked  from  the  very  start.  Even  a  cursory  reading  of  Euclid’s  five 

*C.  J.  Keyser,  Mathematical  Philosophy  (New  York:  E.  P.  Dutton  and  Company,  Inc., 

1922),  p.  113. 
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postulates  discloses  a  very  noticeable  difference  between  the  fifth  postulate 
and  the  other  four;  the  fifth  postulate  lacks  the  terseness  and  the  simple 
comprehensibility  possessed  by  the  other  four,  and  it  certainly  does  not  have 
that  quality  of  ready  acceptance  demanded  by  material  axiomatics.  A  more 
studied  examination  reveals  that  the  fifth  postulate  is  actually  the  converse 
of  Proposition  I  1 7.  *  It  is  not  surprising  that  it  seemed  more  like  a  proposition 
than  a  postulate.  Moreovef ,  Euclid  himself  made  no  use  of  it  until  he  reached 
Proposition  I  29.  It  was  very  natural  to  wonder  whether  the  postulate  was 
really  needed  at  all,  and  to  think  that  perhaps  it  could  be  derived  as  a 
theorem  from  the  remaining  nine  “axioms”  and  “postulates,”  or,  at  least, 
that  it  could  be  replaced  by  a  more  acceptable  equivalent.  Proclus,  who  was 
under  the  illusion  that  he  possessed  a  proof  of  the  postulate,  favored  deleting 
the  postulate  from  the  first  principles.  A  quotation  from  Proclus  might  be 
of  interest: 

This  ought  to  be  struck  out  of  the  postulates  altogether;  for  it  is  a  theorem 
involving  many  difficulties.  Ptolemy,  in  a  certain  book,  set  himself  to  solve  it,  and 
it  requires  for  its  demonstration  a  number  of  definitions  as  well  as  theorems. 
Moreover,  its  converse  is  actually  proved  by  Euclid  himself  as  a  theorem.  It  may 
be  that  some  persons  would  be  deceived,  and  would  think  it  proper  to  place  the 
assumption  in  question  among  the  postulates  as  affording  ground  for  an  instan¬ 
taneous  belief  that  the  straight  lines  converge  and  meet  when  the  two  angles  are 
made  less  than  two  right  angles.  To  such  persons,  Geminus  correctly  replied  that 
we  have  learned  from  the  very  pioneers  of  this  science  not  to  have  any  regard 
for  mere  plausible  imaginings  when  it  is  really  a  question  of  the  reasonings  to  be 
included  in  our  geometrical  doctrine.  Aristotle  says  that  it  is  as  justifiable  to  ask 
scientific  proofs  of  a  rhetorician  as  to  accept  mere  plausibilities  from  a  geometer, 
and  Simmias  is  made  by  Plato  to  say  that  he  recognizes  as  quacks  those  who 
fashion  for  themselves  proofs  from  probabilities.  So  in  this  case,  when  the  two 
right  angles  are  lessened,  the  fact  that  the  straight  lines  converge  is  true  and 
necessary;  but  the  statement  that  they  will  meet  sometime,  since  they  converge 
more  and  more  as  they  are  produced,  is  plausible,  but  it  is  not  necessary  in  the 
absence  of  some  argument  showing  that  this  is  true.  It  is  a  known  fact  that  some 
lines  exist  which  approach  each  other  indefinitely,  but  yet  remain  non-inter¬ 
secting;  this  seems  improbable  and  paradoxical,  but  nevertheless  it  is  true  and 
fully  ascertained  with  regard  to  other  species  of  lines.  May  not  the  same  thing 
which  happens  in  the  case  of  the  lines  referred  to  be  possible  in  the  case  of  straight 
lines?  Indeed,  until  the  statement  in  the  postulate  is  clinched  by  proof,  the  facts 
shown  in  the  case  of  other  lines  may  direct  our  imagination  the  opposite  way. 
Though  the  controversial  arguments  against  the  meeting  of  the  straight  lines 
should  contain  much  that  is  surprising,  is  that  not  all  the  more  reason  why  we 
should  expel  from  our  body  of  doctrine  this  merely  plausible  and  unreasoned 
hypothesis? 

*See  Appendix,  Section  A.l,  for  the  statements  of  the  first  28  propositions  of  Euclid  s 
Book  I. 
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There  were  many  attempts  to  “prove”  the  fifth,  or  parallel,  postulate, 
and  there  were  also  many  substitutes  devised  for  its  replacement.  Of  the 
various  substitutes,  the  one  most  commonly  favored  is  that  which  was  made 
well  known  in  modern  times  by  the  Scottish  physicist  and  mathematician, 
John  Playfair  (1748-1819),  although  this  particular  alternative  had  been 
used  by  others  and  had  even  been  stated  as  early  as  the  fifth  century  by 
Proclus.  It  is  the  substitute  usually  encountered  in  present-day  high  school 
geometry  texts,  namely:  Through  a  given  point  can  be  drawn  only  one  line  parallel 
to  a  given  line.  *  Some  other  alternatives  for  the  parallel  postulate  that  have 
been  either  proposed  or  tacitly  assumed  over  the  years  are  these:  (1)  (Posido¬ 
nius  and  Geminus)  There  exists  a  pair  of  coplanar  straight  lines  everywhere  equally 
distant  from  one  another.  (2)  (Wallis,  Saccheri,  Carnot,  and  Laplace)  There 
exists  a  pair  of  similar  noncongruent  triangles.  (3)  (Saccheri)  If  in  a  quadrilateral 
a  pair  of  opposite  sides  are  equal  and  if  the  angles  adjacent  to  a  third  side  are  right 
angles ,  then  the  other  two  angles  are  also  right  angles.  (4)  (Lambert  and  Clairaut) 
If  in  a  quadrilateral  three  angles  are  right  angles ,  the  fourth  angle  is  also  a  right  angle. 
(5)  (Legendre)  There  exists  at  least  one  triangle  having  the  sum  of  its  three  angles 
equal  to  two  right  angles.  (6)  (Legendre)  Through  any  point  within  an  angle  less 
than  60°  there  can  always  be  drawn  a  straight  line  intersecting  both  sides  of  the  angle. 
(7)  (Legendre  and  W.  Bolyai)  A  circle  can  be  passed  through  any  three  noncollinear 
points.  (8)  (Gauss)  There  is  no  upper  limit  to  the  area  of  a  triangle.  It  constitutes 
an  interesting  and  challenging  collection  of  exercises  for  the  student  to  try 
to  show  the  equivalence  of  these  alternatives  to  the  original  postulate  stated 
by  Euclid.  To  show  the  equivalence  of  Euclid’s  postulate  and  a  particular 
one  of  the  alternatives,  one  must  show  that  the  alternative  follows  as  a 
theorem  from  Euclid’s  assumptions,  and  also  that  Euclid’s  postulate  follows 
as  a  theorem  from  Euclid’s  system  of  assumptions  with  the  parallel  postulate 
replaced  by  the  considered  alternative. 

It  would  be  difficult  to  estimate  the  number  of  attempts  that  have  been 
made,  throughout  the  centuries,  to  deduce  Euclid’s  fifth  postulate  as  a 
consequence  of  the  other  Euclidean  assumptions,  either  explicitly  stated  or 
tacitly  implied.  All  these  attempts  ended  unsuccessfully,  and  most  of  them 
were  sooner  or  later  shown  to  rest  upon  an  assumption  equivalent  to  the 
postulate  itself.  The  earliest  effort,  of  which  we  are  today  aware,  to  prove  the 
postulate  was  made  by  Claudius  Ptolemy  ( ca .  150  a.d.),  alluded  to  by 
Proclus  in  the  quotation  given  above.  Claudius  Ptolemy  was  the  author  of 
the  famous  and  very  influential  Almagest ,  the  great  definitive  Greek  work  on 
astronomy.  Proclus  exposed  the  fallacy  in  Ptolemy’s  attempt  by  showing  that 
Ptolemy  had  unwittingly  assumed  that  through  a  point  only  one  parallel  can 
be  drawn  to  a  given  line;  this  assumption  is  the  Playfair  equivalent  of  Euclid’s 

Propositions  I  27  and  I  28  guarantee,  under  the  tacit  assumption  of  the  infinitude 
of  straight  lines,  the  existence  of  at  least  one  parallel. 
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postulate.  Proclus  submitted  an  attempt  of  his  own,  but  his  “proof”  rests  on 
the  assumption  that  parallel  lines  are  always  a  bounded  distance  apart,  and 
this  assumption  can  be  shown  to  imply  Euclid’s  fifth  postulate.  Among  the 
more  noteworthy  attempts  of  somewhat  later  times  is  one  made  in  the 
thirteenth  century  by  Nasir-ed-din  (1201-1272),  a  Persian  astronomer  and 
mathematician  who  compiled,  from  an  earlier  Arabic  translation,  an 
improved  edition  of  the  Elements,  and  who  wrote  a  treatise  on  Euclid’s 
postulates,  but  his  attempt,  too,  involves  a  tacit  assumption  equivalent  to 
the  postulate  being  “proved.” 

An  important  stimulus  to  the  development  of  geometry  in  western 
Europe  after  the  Renaissance  was  a  renewal  of  the  criticism  of  Euclid’s  fifth 
postulate.  Hardly  any  critical  comments  are  to  be  found  in  the  early  printed 
editions  of  the  Elements  made  at  the  end  of  the  fifteenth  century  and  at  the 
beginning  of  the  sixteenth  century.  However,  after  the  translation,  in  1533, 
of  Proclus’  Commentary  on  Euclid ,  Book  I,  many  men  once  again  embarked 
upon  a  critical  analysis  of  the  fifth  postulate.  For  example,  John  Wallis 
(1616-1703),  while  lecturing  at  Oxford  University,  became  interested  in  the 
work  of  Nasir-ed-din,  and  in  1663  offered  his  own  “proof”  of  the  parallel 
postulate,  but  this  attempt  involves  the  equivalent  assumption  that  similar 
noncongruent  triangles  exist.  So  it  was  with  all  the  many  attempts  to  derive 
Euclid’s  postulate  as  a  theorem;  each  attempt  involved  the  vitiating  circu¬ 
larity  of  assuming  something  equivalent  to  the  thing  being  established,  or 
else  committed  some  other  form  of  fallacious  reasoning.  Most  of  this  vast 
amount  of  work  is  of  little  real  importance  in  the  actual  evolution  of  mathe¬ 
matical  thought  until  we  come  to  the  remarkable  investigation  of  the  parallel 
postulate  made  by  Girolamo  Saccheri  in  1733. 

3.2  Saccheri  and  the  Reductio 
ad  Absurdum  Method 

Every  student  of  elementary  geometry  has  encountered  the  so-called 
indirect,  or  reductio  ad  absurdum ,  *  method  of  proof.  It  is  a  powerful,  and  at 
times  seemingly  indispensable,  method,  which  is  employed  frequently  by 
Euclid  in  his  Elements.  The  method,  it  will  be  recalled,  consists  of  assuming, 
by  way  of  hypothesis,  that  a  proposition  that  is  to  be  established  is  false; 
if  an  absurdity  follows,  one  concludes  that  the  hypothesis  is  untenable  and 
that  the  original  proposition  must  then  be  true.  It  was  this  method  of  proof 

that  we  employed  in  Section  1.5  to  show  that  V7 2  is  irrational. 

*In  a  more  refined  treatment  one  distinguishes  several  slight  variations  in  the  indirect 
method  of  proof,  and  then  it  is  customary  to  assign  the  technical  terminology,  reductio 
ad  absurdum ,  to  a  particular  one  of  these  variations.  We  do  not  make  this  refinement 
here. 
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To  illustrate  further  the  reductio  ad  absurdum  method,  let  us  briefly 
consider  Euclid’s  Proposition  I  6,  the  first  proposition  in  the  Elements  estab¬ 
lished  by  this  type  of  proof.  We  wish  to  prove  the  theorem:  If  in  a  triangle 
two  angles  are  equal  to  each  other ,  then  the  two  sides  opposite  these  angles  are  also 
equal  to  each  other .  Let  ABC  (Figure  15)  be  the  triangle,  and  suppose  fiABC  = 
fiACB.  We  wish  to  show  that  side  AB  =  side  AC.  Suppose  the  sides  AB 


A 


and  AC  are  not  equal  to  each  other.  Then  one  of  them,  say  AB ,  is  greater 
than  the  other,  and  we  may  mark  off  on  BA  a  segment  BD  equal  to  the 
lesser  side  AC.  Now  in  triangles  ABC  and  DCB  we  have  CB  =  BC,  CA  =  BD, 
flBCA  =  fiCBD.  It  follows  that  the  triangles  are  congruent.  But  this 
conclusion  is  impossible,  since  triangle  DCB  is  only  a  part  of  triangle 
ABC.  Our  hypothesis  that  AB  ^  AC  has  led  to  an  absurd  situation,  and 
hence  is  untenable.  We  must  conclude,  therefore,  that  AB  =  AC,  and  our 
theorem  is  established. 

It  is  not  difficult  to  see  that  the  reductio  ad  absurdum  method  rests  on  two 
cardinal  principles  of  classical  logic,  namely,  the  law  of  contradiction  and  the 
law  of  the  excluded  middle.  Somewhat  loosely  described,  the  law  of  contra¬ 
diction  says  that  if  S  is  any  statement,  then  S  and  a  contradiction  (that  is,  any 
denial)  of  S  cannot  both  hold,  and  the  law  of  the  excluded  middle  says  that 
either  S  or  some  denial  of  S  must  hold  (that  is,  there  is  no  third,  or  middle, 
possibility).  As  an  illustration,  suppose  S  is  the  statement,  “ Archimedes  was 
born  in  287  b.c.”  The  law  of  contradiction  then  asserts  that  Archimedes 
cannot  have  been  born  in  both  287  b.c.  and  400  a.d.,  for  example,  and  the 
law  of  the  excluded  middle  asserts  that  Archimedes  either  was  born  in 
287  b.c.  or  he  was  not  born  in  287  b.c.  Now  let  S  be  the  statement  of  any 
proposition  to  be  established  by  the  reductio  ad  absurdum  method,  for  example, 
the  statement  of  Proposition  I  6  above.  By  the  method,  we  set  about  and 
show  that  any  denial  of  S  implies  a  denial  of  some  previously  assumed  or 
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established  statement  T.  Now,  by  the  law  of  contradiction,  T  and  this  denial 
of  T  cannot  both  be  true  (that  is,  cannot  both  follow  from  the  postulates). 
Since  T  is  true,  the  denial  of  T  is  then  false,  whence,  since  a  true  statement 
can  never  imply  a  false  one,  it  follows  that  any  denial  of  S  must  also  be  false. 
But,  by  the  law  of  the  excluded  middle,  either  S  is  true  or  some  denial  of  S 
is  true.  Since  any  denial  of  S  is  false,  it  follows  that  S  is  true,  and  our  propo¬ 
sition  is  established. 

The  law  of  contradiction  and  the  law  of  the  excluded  middle  have 
settled  so  deeply  into  the  warp  and  woof  of  human  thinking  that  it  is  difficult 
to  conceive  of  questioning  their  validity.  We  shall  see  later,  however,  that 
although  these  laws  are  usually  pronounced  as  universally  true,  some  sort  of 
limitation  must  be  made  concerning  their  applicability.  Indeed,  since  1912, 
some  mathematicians  have  felt  that  we  must  drastically  restrict  the  free  use 
of  the  law  of  the  excluded  middle  as  part  of  the  logical  machinery  used  in 
deducing  theorems  from  postulates.  But  more  of  this  in  its  proper  place. 
For  the  time  being  we  shall  accept  these  laws,  particularly  insofar  as  they 
apply  to  the  reductio  ad  absurdum  method  of  proof. 

One  concluding  remark  about  the  reductio  ad  absurdum  method  seems 
appropriate  here.  In  the  game  of  chess  a  gambit  is  one  of  various  possible 
openings  in  which  a  pawn  or  a  piece  is  risked  in  order  to  obtain  an 
advantageous  attack.  The  eminent  English  mathematician,  G.  H.  Hardy 
(1878-1948),  delightfully  pointed  out  that  reductio  ad  absurdum  “is  a  far  finer 
gambit  than  any  chess  gambit:  a  chess  player  may  offer  the  sacrifice  of  a 
pawn  or  even  a  piece,  but  a  mathematician  offers  the  game.”*  Reductio  ad 
absurdum  emerges  as  the  most  stupendous  gambit  conceivable. 

Reductio  ad  absurdum  certainly  constitutes  one  of  the  finest  weapons  in 
a  mathematician’s  armory  of  attack;  with  this  weapon  Girolamo  Saccheri 
in  1733  made  the  first  really  scientific  assault  on  the  problem  of  Euclid’s 
parallel  postulate. 

Little  is  known  of  Saccheri’s  life.  He  was  born  in  San  Remo  in  1667, 
showed  marked  precocity  as  a  youngster,  completed  his  novitiate  for  the 
Jesuit  Order  at  the  age  of  twenty-three,  and  then  spent  the  rest  of  his  life 
filling  a  succession  of  university  teaching  posts.  While  instructing  rhetoric, 
philosophy,  and  theology  at  a  Jesuit  College  in  Milan,  Saccheri  read  Euclid’s 
Elements  and  became  enamored  with  the  powerful  method  of  reductio  ad 
absurdum.  Later,  while  teaching  philosophy  at  Turin,  Saccheri  published  his 
Logica  demonstrativa ,  in  which  the  chief  innovation  is  the  application  of  the 
method  of  reductio  ad  absurdum  to  the  treatment  of  formal  logic.  Some  years 
after,  while  a  professor  of  mathematics  at  the  University  of  Pavia,  it  occurred 
to  Saccheri  to  apply  his  favorite  method  of  reductio  ad  absurdum  to  a  study  of 

*G.  H.  Hardy,  A  Mathematician's  Apology  (New  York:  Cambridge  University  Press, 

1941),  p.  34. 
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Euclid’s  parallel  postulate.  He  was  well  prepared  for  the  task,  having  dealt 
ably  in  his  earlier  work  on  logic  with  such  matters  as  definitions  and  postu¬ 
lates.  Also,  he  was  acquainted  with  the  work  of  others  regarding  the  parallel 
postulate  and  had  succeeded  in  pointing  out  the  fallacies  in  the  attempts  of 
Nasir-ed-din  and  Wallis. 

Saccheri’s  effort  to  establish  Euclid’s  parallel  postulate  by  attempting 
to  institute  a  reductio  ad  absurdum  was  apparently  the  first  time  anyone  had 
conceived  the  idea  of  denying  the  postulate  and  of  studying  the  consequences 
of  a  contradiction  of  the  famous  assumption.  The  result  of  these  researches 
was  a  little  book  entitled  Euclides  ab  omni  naevo  vindicatus  (Euclid  Freed  of 
Every  Flaw),  which  was  printed  in  Milan  in  1733,  only  a  few  months  before 
the  author’s  death.  In  this  work  Saccheri  accepts  the  first  28  propositions 
of  Euclid’s  Elements ,  which,  as  we  have  previously  stated,  do  not  require  the 
fifth  postulate  for  their  proof.  With  the  aid  of  these  theorems  he  then  proceeds 
to  study  the  isosceles  birectangle ,  that  is,  a  quadrilateral  ABDC  in  which  (see 
Figure  16)  AC  =  BD  and  the  angles  at  A  and  B  are  right  angles.  By  drawing 
the  diagonals  AD  and  BC  and  then  using  simple  congruence  theorems  (which 
are  found  among  Euclid’s  first  28  propositions),  Saccheri  easily  shows  that 
the  angles  at  C  and  D  are  equal  to  each  other.  But  nothing  can  be  ascertained 
in  regard  to  the  magnitude  of  these  angles.  Of  course,  as  a  consequence  of 
Euclid’s  fifth  postulate,  it  follows  that  these  angles  are  both  right  angles, 
but  the  assumption  of  this  postulate  is  not  to  be  employed.  As  a  result,  the 
two  angles  might  both  be  right  angles,  or  both  be  obtuse  angles,  or  both  be 
acute  angles.  Here  Saccheri  maintains  an  open  mind,  and  names  the  three 
possibilities:  the  hypothesis  of  the  right  angle ,  the  hypothesis  of  the  obtuse  angle , 
and  the  hypothesis  of  the  acute  angle.  The  plan  of  the  work  is  to  rule  out  the 
last  two  possibilities  by  showing  that  their  respective  assumptions  lead  to 
absurdities,  thus  leaving,  by  the  reductio  ad  absurdum  method,  the  first 
hypothesis.  But  this  hypothesis  can  be  shown  to  be  equivalent  to  Euclid’s 
fifth  postulate.  In  this  way  the  parallel  postulate  is  to  be  established  and  the 
blemish  of  its  assumption  by  Euclid  removed. 


The  task  of  eliminating  the  hypothesis  of  the  obtuse  angle  and  the 
hypothesis  of  the  acute  angle  turns  out  to  be  rather  arduous.  With  real 
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geometrical  skill  and  fine  logical  penetration,  Saccheri  establishes  a  number 
of  theorems,  of  which  the  following  are  among  the  more  important: 

1 .  If  one  of  the  hypotheses  is  true  for  a  single  isosceles  hirectangular  quadrilateral , 
it  is  true  for  every  such  quadrilateral. 

2.  On  the  hypothesis  of  the  right  angle ,  the  obtuse  angle ,  or  the  acute  angle ,  the 
sum  of  the  angles  of  a  triangle  is  respectively  equal  to,  greater  than,  or  less  than 
two  right  angles. 

3.  If  there  exists  a  single  triangle  for  which  the  sum  of  the  angles  is  equal  to,  greater 
than,  or  less  than  two  right  angles,  there  follows  the  truth  of  the  hypothesis  of 
the  right  angle,  the  obtuse  angle,  or  the  acute  angle. 

4.  On  the  hypothesis  of  the  right  angle  two  straight  lines  intersect,  except  in  the 

one  case  in  which  a  transversal  cuts  them  under  equal  corresponding  angles.  On 

the  hypothesis  of  the  obtuse  angle  two  straight  lines  always  intersect.  On  the 
hypothesis  of  the  acute  angle  there  is  a  whole  pencil  of  lines  through  a  given  point 
which  do  not  meet  a  given  straight  line. 

5.  The  locus  of  the  extremity  of  a  perpendicular  of  constant  length  which  moves 

with  its  other  end  on  a  fixed  straight  line  is  a  straight  line  on  the  hypothesis  of 

the  right  angle,  a  curve  convex  to  the  fixed  line  on  the  hypothesis  of  the  obtuse 
angle,  and  a  curve  concave  to  the  fixed  line  on  the  hypothesis  of  the  acute  angle. 
After  establishing  a  chain  of  13  propositions,  Saccheri  manages  to 

dispose  of  the  hypothesis  of  the  obtuse  angle,  but  in  so  doing  he  makes  the 
same  tacit  assumption  that  Euclid  made  concerning  the  infinitude  of  the 
straight  line.  With  this  tacit  assumption  (introduced  by  using  Euclid’s 
Proposition  118,  which  depends  on  I  16)  Saccheri  shows  that  the  hypothesis 
of  the  obtuse  angle  implies  Euclid’s  fifth  postulate,  which,  in  turn,  implies 
that  the  sum  of  the  angles  of  a  triangle  is  equal  to  two  right  angles.  But  this 
second  implication  contradicts  the  theorem  that,  on  the  hypothesis  of  the 
obtuse  angle,  the  sum  of  the  angles  of  a  triangle  is  greater  than  two  right 
angles. 

The  case  of  the  hypothesis  of  the  acute  angle  proves  to  be  even  more 
stubborn,  and  Saccheri  requires  nearly  20  more  propositions  before  he  feels 
he  can  dispose  of  it.  After  obtaining  many  of  the  theorems  that  were  later 
to  become  classical  in  non-Euclidean  geometry,  Saccheri  weakly  forces  into 
his  development  an  unconvincing  contradiction  involving  vague  concepts 
about  elements  at  infinity.  The  contradiction  that  he  reaches  is  that  there 
exist  two  straight  lines  which  when  produced  to  infinity  merge  into  one 
another  and  there  have  a  common  perpendicular.  Coming  after  the  careful 
work  that  has  been  presented  up  to  this  point,  it  is  difficult  to  believe  that 
Saccheri  himself  was  really  convinced  by  his  lame  ending.  Indeed,  in  a 
second  part  to  his  work,  he  attempted,  with  no  greater  success,  a  second 
attack  on  the  hypothesis  of  the  acute  angle.  Had  Saccheri  not  been  so  eager 
to  exhibit  a  contradiction,  but  rather  had  boldly  admitted  his  inability  to 
find  one,  he  would  today  unquestionably  be  credited  with  the  discovery  of 
non-Euclidean  geometry. 
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It  is  difficult  to  evaluate  the  influence  that  Saccheri’s  work  may  have 
had  on  later  researches  connected  with  the  parallel  postulate,  for  subse¬ 
quently  his  little  publication  was  lost  for  a  long  time.  It  was  dramatically 
resurrected  in  1889  by  Saccheri’s  compatriot,  Eugenio  Beltrami  (1835-1900), 
a  mathematician  who,  as  we  shall  soon  see,  made  notable  contributions  of 
his  own  to  the  subject  of  non-Euclidean  geometry.  The  first  part  of  Saccheri’s 
work  has  been  translated  into  English*  and  can  be  easily  read  by  any  student 
of  elementary  plane  geometry. 


3.3  The  Work  of  Lambert 

and  Legendre 

In  1766,  thirty-three  years  after  Saccheri’s  publication,  the  German 
mathematician  Johann  Heinrich  Lambert  (1728-1777)  wrote  an  investi¬ 
gation  of  the  parallel  postulate  entitled  Die  Theone  der  Parallellinien ,  which, 
however,  was  not  published  until  eleven  years  after  the  author’s  death. 
Lambert’s  treatise  is  in  three  parts.  The  first  part  considers  whether  Euclid’s 
fifth  postulate  can  be  proved  from  Euclid’s  other  assumptions  or  only  if 
additional  assumptions  are  made.  The  second  part  is  concerned  with  the 
reduction  of  the  parallel  postulate  to  various  equivalent  propositions.  It  is 
the  last  part  of  the  study  that  closely  resembles  the  earlier  work  by  Saccheri. 
Here  Lambert  chooses  as  a  fundamental  figure  the  trirectangle ,  or  quadrilateral 
containing  three  right  angles,  which  can  be  regarded  as  the  half  of  a  Saccheri 
isosceles  birectangle  formed  by  joining  the  midpoints  of  the  latter’s  bases. 
As  with  Saccheri,  three  hypotheses  arise,  according  to  whether  the  fourth 
angle  of  the  trirectangle  is  right,  obtuse,  or  acute. 

Lambert  went  considerably  beyond  Saccheri  in  deducing  propositions 
under  the  hypotheses  of  the  obtuse  and  acute  angles.  Thus,  not  only  did  he 
show  that  for  the  three  hypotheses  the  sum  of  the  angles  of  a  triangle  is 
equal  to,  greater  than,  or  less  than  two  right  angles,  respectively,  but  in 
addition  he  showed  that  the  excess  above  two  right  angles  in  the  hypothesis 
of  the  obtuse  angle,  or  the  deficiency  below  two  right  angles  in  the  hypoth¬ 
esis  of  the  acute  angle,  is  proportional  to  the  area  of  the  triangle.  This 
result  led  him  to  observe  the  resemblance  to  spherical  geometry  of  the 
geometry  following  from  the  hypothesis  of  the  obtuse  angle  (in  spherical 
geometry  the  area  of  a  triangle  is  proportional  to  its  spherical  excess),  and 
conjectured  that  the  geometry  following  from  the  hypothesis  of  the  acute 
angle  could  perhaps  be  verified  on  a  sphere  of  imaginary  radius. 

Another  notable  discovery  made  by  Lambert  concerns  the  measurement 
of  lengths  in  the  two  geometries  which  follow  from  the  obtuse-angle  and 
acute-angle  hypotheses.  In  Euclidean  geometry,  because  similar  noncon- 

*See  G.  B.  Halsted  [2]  or  D.  E.  Smith  [2],  pp.  351-359  (Bibliography). 
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gruent  figures  exist,  lengths  can  be  measured  only  in  terms  of  some  arbitrary 
unit  which  has  no  structural  connection  with  the  geometry.  Angles,  on  the 
other  hand,  possess  a  natural  unit  of  measure,  such  as  the  right  angle  or  the 
radian,  which  is  capable  of  geometrical  definition.  This  is  what  is  meant 
when  mathematicians  say  that  in  Euclidean  geometry  lengths  are  relative  but 
angles  are  absolute.  Lambert  discovered  that  under  the  hypotheses  of  the 
obtuse  and  acute  angles,  angles  are  still  absolute,  but  lengths  are  absolute 
also !  In  fact,  it  can  be  shown  for  these  geometries  that  for  every  angle  there 
is  a  corresponding  line  segment,  so  that  to  a  natural  unit  of  measure  for 
angles  there  corresponds  a  natural  unit  of  measure  for  lengths. 

Lambert  eliminates  the  hypothesis  of  the  obtuse  angle  by  making  the 
same  tacit  assumption  that  Saccheri  made,  but  his  conclusions  with  regard 
to  the  hypothesis  of  the  acute  angle  are  indefinite  and  unsatisfactory.  Indeed, 
it  was  this  incomplete  and  unsettled  state  of  affairs  with  regard  to  the  acute 
hypothesis  that  held  Lambert  from  publishing  his  work,  with  the  result  that 
it  did  not  appear  until  friends  finally  put  it  through  the  press  after  his  death. 

Lambert  was  a  mathematician  of  high  quality.  As  the  son  of  a  poor 
tailor  he  was  largely  self-taught.  He  possessed  a  fine  imagination  and 
established  his  results  with  great  attention  to  rigor.  In  fact,  Lambert  was 
the  first  to  prove  rigorously  that  the  number  7 r  is  irrational.  He  showed  that 
if  v  is  rational,  but  not  zero,  then  tan  x  cannot  be  rational;  since  tan  7r/4  =  1, 
it  follows  that  tt/4,  or  7 r,  cannot  be  rational.  We  also  owe  to  Lambert  the 
first  systematic  development  of  the  theory  of  hyperbolic  functions,  and, 
indeed,  our  present  notation  for  these  functions.  Lambert  was  a  many-sided 
scholar  who  contributed  to  the  mathematics  of  numerous  other  topics,  such 
as  descriptive  geometry,  the  determination  of  comet  orbits,  and  the  theory 
of  projections  employed  in  the  making  of  maps. 

A  third  distinguished  effort  to  establish  Euclid’s  parallel  postulate  by 
the  reductio  ad  absurdwn  method  was  essayed,  over  a  long  period  of  years, 
by  the  eminent  Lrench  analyst  Adrien-Marie  Legendre  (1752—1833).  He 
began  anew  and  considered  three  hypotheses  according  to  whether  the  sum 
of  the  angles  of  a  triangle  is  equal  to,  greater  than,  or  less  than  two  right 
angles.  Tacitly  assuming  the  infinitude  of  a  straight  line,  he  was  able  to 
eliminate  the  second  hypothesis,  but,  although  he  made  several  attempts, 
he  could  not  dispose  of  the  third  hypothesis.  These  various  endeavors  appear 
in  the  successive  editions  of  his  very  popular  Elements  de  geometne ,  *  which 


*This  work  is  an  attempted  pedagogical  improvement  of  Euclid’s  Elements  made  by 
considerably  rearranging  and  simplifying  the  propositions.  The  work  won  high  regaid 
in  continental  Europe  and  was  so  favorably  received  in  the  United  States  that  it 
became  the  prototype  of  the  elementary  geometry  textbooks  in  this  country.  The  first 
English  translation  was  made  in  the  United  States  in  1819  by  John  Farrar  of  Harvard 
University.  The  next  English  translation  was  made  in  1822  by  the  famous  Scottish 
litterateur,  Thomas  Carlyle,  who  early  in  life  was  a  teacher  of  mathematics.  Carlyle  s 
translation  ran  through  33  American  editions. 
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ran  from  a  first  edition  in  1794  to  a  twelfth  in  1823.  Legendre’s  first  effort 
is  vitiated  by  the  assumption  that  the  choice  of  a  unit  of  length  will  not 
affect  the  correctness  of  his  propositions,  but  this,  of  course,  is  equivalent  to 
assuming  the  existence  of  similar  noncongruent  figures.  The  next  attempt  is 
vitiated  by  assuming  the  existence  of  a  circle  through  any  three  noncollinear 
points.  Later  Legendre  independently  observed  the  fact  already  discovered 
by  Lambert,  that,  under  the  third  hypothesis,  the  deficiency  of  the  sum 
of  the  angles  of  a  triangle  below  two  right  angles  is  proportional  to  the  area 
of  the  triangle.  Hence,  Legendre  reasoned,  if  by  starting  with  any  given 
triangle  one  could  obtain  another  triangle  containing  the  given  triangle  at 
least  twice,  then  the  deficiency  for  this  new  triangle  would  be  at  least  twice 
the  deficiency  for  the  given  triangle.  By  repeating  the  operation  a  sufficient 
number  of  times,  one  could  finally  end  with  a  triangle  whose  angle  sum  has 
become  negative,  a  situation  which  is  absurd.  But,  in  order  to  solve  the 
problem  of  obtaining  a  triangle  containing  a  given  triangle  at  least  twice, 
Legendre  found  he  had  to  assume  that  through  any  point  within  a  given 
angle  less  than  60°  there  can  always  be  drawn  a  straight  line  intersecting 
both  sides  of  the  angle,  and  this,  as  we  have  pointed  out,  is  equivalent  to 
Euclid’s  fifth  postulate.  Legendre  gave  an  elegant  proof  of  the  theorem, 
if  there  exists  a  single  triangle  having  the  sum  of  its  angles  equal  to  two  right  angles , 
then  the  sum  of  the  angles  of  every  triangle  is  equal  to  two  right  angles.  Although 
this  theorem  is  contained  in  the  results  given  by  Saccheri,  it  is  generally 
referred  to  as  Legendre's  second  theorem.  Legendre's  first  theorem  is  the  sum  of  the 
three  angles  of  a  triangle  cannot  be  greater  than  two  right  angles.  Of  course,  in 
proving  this  theorem,  Legendre  tacitly  assumed  the  infinitude  of  straight 
lines.  In  fact,  in  proving  both  his  first  and  second  theorems,  Legendre 
assumed  the  postulate  of  Archimedes.  Max  Dehn  (1878-1952)  has  shown 
that  this  assumption  is  unavoidable  in  proving  the  first  theorem  but  not 
necessary  in  proving  the  second. 

Legendre’s  last  paper  on  parallels,  essentially  a  collection  of  his  earlier 

efforts,  was  published  in  1833,  the  year  of  his  death.  He  perhaps  holds  the 

record  for  persistence  in  attemping  to  prove  the  famous  postulate.  The  simple 

and  straightforward  style  of  his  proofs,  widely  circulated  because  of  their 

/ 

appearance  in  his  Elements ,  and  his  high  eminence  in  the  world  of  mathe¬ 
matics,  created  marked  popular  interest  in  the  parallel  postulate.  Actually, 
however,  Legendre  had  scarcely  progressed  as  far  as  had  Saccheri  a  hundred 
years  earlier.  Moreover,  even  before  the  appearance  of  his  last  paper,  a 
Russian  mathematician,  separated  from  the  rest  of  the  scientific  world  by 
barriers  of  distance  and  language,  had  taken  a  most  significant  step,  the 
boldness  and  importance  of  which  were  far  to  transcend  anything  Legendre 
had  done  on  the  subject. 
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3.4  The  Discovery  *  of  Non- Euclidean 

Geometry 

We  have  seen  that,  in  spite  of  considerable  effort  exerted  over  a  long 
period  of  time,  no  one  was  able  to  find  a  contradiction  under  the  hypothesis 
of  the  acute  angle.  It  is  no  wonder  that  no  contradiction  was  found  under 
this  hypothesis,  for  it  is  now  known  that  the  geometry  developed  from  a 
certain  basic  set  of  assumptions  plus  the  acute  angle  hypothesis  is  as  con¬ 
sistent  as  the  Euclidean  geometry  developed  from  the  same  basic  set  of 
assumptions  plus  the  hypothesis  of  the  right  angle.  In  other  words,  it  is  now 
known  that  the  parallel  postulate  cannot  be  deduced  as  a  theorem  from  the 
other  assumptions  of  Euclidean  geometry  but  is  independent  of  those  other 
assumptions.  It  took  unusual  imagination  to  entertain  such  a  possibility,  for 
the  human  mind  had  for  two  millennia  been  bound  by  the  prejudice  of  tra¬ 
dition  to  the  firm  belief  that  Euclid’s  geometry  was  most  certainly  the  only 
true  one,  and  that  any  contrary  geometric  system  simply  could  not  be 
consistent. 

The  first  to  suspect  the  independence  of  the  parallel  postulate  were 
Carl  Friedrich  Gauss  (1777—1855)  of  Germany,  Johann  Bolyai  (1802—1860) 
of  Hungary,  and  Nicolai  Ivanovitch  Lobachevsky  (1793-1856)  of  Russia. 
These  men  independently  approached  the  subject  through  the  Playfair  form 
of  the  parallel  postulate  by  considering  the  three  possibilities:  Through  a 
given  point  can  be  drawn  just  one  line,  no  line,  or  more  than  one  line  parallel 
(in  Euclid’s  sense)  to  a  given  line.  These  three  situations  are  equivalent, 
respectively,  to  the  hypotheses  of  the  right,  the  obtuse,  and  the  acute  angles. 
Assuming,  as  did  their  predecessors,  the  infinitude  of  a  straight  line,  the 
second  case  was  easily  eliminated.  Inability  to  find  a  contradiction  in  the 
third  case,  however,  led  each  of  the  three  mathematicians  to  suspect,  in  time, 
a  consistent  geometry  under  that  hypothesis,  and  each,  unaware  of  the 
work  of  the  other  two,  carried  out,  for  its  own  intrinsic  interest,  an  extensive 
development  of  the  resulting  new  geometry. 

Gauss  was  perhaps  the  first  person  really  to  anticipate  a  non-Euclidean 
geometry.  Although  he  meditated  a  good  deal  on  the  matter  from  very  early 
youth  on,  probably  not  until  his  late  twenties  did  he  begin  to  suspect  the 
parallel  postulate  to  be  independent  of  Euclid’s  other  assumptions.  Unfortu¬ 
nately,  Gauss  failed,  throughout  his  life,  to  publish  anything  on  the  subject, 
and  his  advanced  conclusions  are  known  to  us  only  through  copies  of 
letters  to  interested  friends,  a  couple  of  published  reviews  of  works  of  others, 
and  some  notes  found  among  his  papers  after  his  death.  Although  he  refrained 
from  publishing  his  own  findings,  he  strove  to  encourage  others  to  persist 

*We  are  not  here  concerned  with  any  philosophical  distinction  between  discovery 

and  invention. 
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in  similar  investigations,  and  it  was  he  who  called  the  new  geometry 
non- Euclidean. 

Apparently  the  next  person  to  anticipate  a  non-Euclidean  geometry 
was  Johann  Bolyai,  who  was  a  Hungarian  officer  in  the  Austrian  army 
and  the  son  of  the  mathematician  Wolfgang  Bolyai,  a  long-time  personal 
friend  of  Gauss.  The  younger  Bolyai  undoubtedly  received  considerable 
stimulus  for  his  study  from  his  father,  who  had  earlier  shown  an  interest 
in  the  problem  of  the  parallel  postulate.  As  early  as  1823  Johann  Bolyai 
began  to  understand  the  real  nature  of  the  problem  that  faced  him,  and  a 
letter  written  during  that  year  to  his  father  shows  the  enthusiasm  he  held 
for  his  work.  In  this  letter  he  discloses  a  resolution  to  publish  a  tract  on  the 
theory  of  parallels  as  soon  as  he  can  find  the  time  and  opportunity  to  put 
the  material  in  order,  and  exclaims,  “Out  of  nothing  I  have  created  a  strange 
new  universe.”  The  father  urged  that  the  proposed  tract  be  published  as  an 
appendix  to  his  own  large  two-volume  semiphilosophical  work  on  elementary 
mathematics.  The  expansion  and  arrangement  of  ideas  proceeded  more 
slowly  than  Johann  had  anticipated,  but  finally,  in  1829,  he  submitted  the 
finished  manuscript  to  his  father,  and  three  years  later,  in  1832,  the  tract 
appeared  as  a  twenty-six  page  appendix  to  the  first  volume  of  his  father’s 
work.*  Johann  Bolyai  never  published  anything  further,  although  he  did 
leave  behind  a  great  pile  of  manuscript  pages.  His  chief  interest  was  in  what 
he  called  “the  absolute  science  of  space,”  by  which  he  meant  the  collection 
of  those  propositions  which  are  independent  of  the  parallel  postulate  and 
which  consequently  hold  in  both  the  Euclidean  geometry  and  the  new 
geometry.  For  example,  the  familiar  law  of  sines  for  triangle  ABC , 

a  :  b  :  c  =  sin  A  :  sin  B  :  sin  C, 

holds  only  in  the  Euclidean  geometry,  but  if  modified  to  read 

0(a)  :  0(b)  :  0(c)  =  sin  A  :  sin  B  :  sin  C, 

where  0(r)  denotes  the  circumference  of  a  circle  of  radius  r,  then  the  modified 
law  holds  in  both  of  the  geometries.  This,  then,  is  the  form  that  the  law  of 
sines  takes  in  Bolyai’s  work.  It  is  not  difficult  to  show  that  this  same  form  of 
the  sine  law  also  holds  in  the  geometry  of  triangles  on  a  sphere. 

Although  Gauss  and  Johann  Bolyai  are  acknowledged  to  be  the  first 
to  conceive  a  non-Euclidean  geometry,  actually  the  Russian  mathematician 
Lobachevsky  published  the  first  really  systematic  development  of  the  subject. 
Lobachevsky  spent  the  greater  part  of  his  life  at  the  University  of  Kasan, 
first  as  a  student,  later  as  a  professor  of  mathematics,  and  finally  as  rector, 
and  his  earliest  paper  on  non-Euclidean  geometry  was  published  in  1 829 — 
1 830  in  the  Kasan  Bulletin ,  two  to  three  years  before  Bolyai’s  work  appeared 
in  print.  This  memoir  attracted  only  slight  attention  in  Russia,  and,  because 
it  was  written  in  Russian,  practically  no  attention  elsewhere.  Lobachevsky 

*For  a  translation  of  this  appendix,  see  R.  Bonola,  or  D.  E.  Smith  [2],  pp.  375—388. 


Sec.  3.4  THE  DISCOVERY  OF  NON-EUCLIDEAN  GEOMETRY  71 


followed  this  initial  effort  with  other  presentations.  For  example,  in  the  hope 
of  reaching  a  wider  group  of  readers,  he  published,  in  1840,  a  little  book 
written  in  German  entitled  Geometrische  Untersuchungen  zur  7  heone  der  Paral- 
lellinien  (Geometrical  Researches  on  the  Theory  of  Parallels),*  and  then 
still  later,  in  1855,  a  year  before  his  death  and  after  he  had  become  blind, 
he  published  in  French  a  final  and  more  condensed  treatment  entitled 
Pangeometrie  (Pangeometry). f  So  slowly  did  information  of  new  discoveries 
spread  in  those  days  that  Gauss  probably  did  not  hear  of  Lobachevsky’s 
work  until  the  appearance  of  the  German  publication  in  1840,  and  Johann 
Bolyai  was  unaware  of  it  until  1848.  Lobachevsky  himself  did  not  live  to  see 
his  work  accorded  any  wide  recognition,  but  the  non-Euclidean  geometry 
which  he  developed  is  nowadays  frequently  referred  to  as  Lobachevskian 
geometry. 

The  characterizing  postulate  of  Lobachevskian  geometry,  which  replaces 
Euclid’s  parallel  postulate,  is  that  through  a  given  point  P,  not  on  a  given  line  m, 
more  than  one  line  can  be  drawn  lying  in  the  plane  of  P  and  m  and  not  intersecting  m. 
On  the  basis  of  this  postulate,  together  with  the  other  assumptions  of 
Euclidean  geometry,  it  is  not  difficult  to  show  (see  Figure  17)  that  there 
are  always  two  lines  through  P  which  do  not  intersect  m,  which  make  equal 


acute  angles  a  with  the  perpendicular  PQ  from  P  to  m,  and  which  are  such 
that  any  line  through  P  lying  within  the  angle  formed  by  the  two  lines  and 
containing  the  perpendicular  PQ  intersects  m,  while  any  other  line  through 
P  does  not  intersect  m.  The  two  lines  through  P  are,  then,  boundary  lines 
separating  all  lines  through  P  into  two  classes,  those  which  cut  m  and  those 
which  do  not.  From  the  viewpoint  of  Euclid,  all  the  lines  through  P  which 
do  not  cut  m  should  be  called  parallels  to  m.  Lobachevsky,  however,  uses 
this  term  more  reservedly,  and  refers  to  only  the  two  boundary  lines  as 
being  parallel  to  m.  Other  lines  passing  through  P  but  not  cutting  m  may 
be  said  to  be  hyperparallel  to  m.  The  acute  angle  a  is  called  the  angle  of 
parallelism ,  and  it  plays  an  important  role  in  Lobachevsky’s  development. 
He  shows  that  the  size  of  a  depends  on  the  length  h  of  the  perpendicular 
PQ,  and  emphasizes  this  by  denoting  a  by  the  functional  symbol  Il(//).  In 


*N.  Lobachevsky. 

fD.  E.  Smith  [2],  360-374. 


72  NON-EUCLIDEAN  GEOMETRY 


fact,  he  shows  that  if  the  unit  of  length  is  chosen  as  the  distance  which 
corresponds  to  the  particular  angle  of  parallelism 

a  —  2  Arc  tan  e~x , 

where  e  is  the  base  for  natural  logarithms,  then 

n(/?)  =  2  Arc  tan  e~h  and  h  —  In  cot  — 

2 

We  note  that  the  angle  of  parallelism,  II(/z),  increases  from  0  to  7r/2  as  h 
decreases  from  °o  to  0,  so  that,  “in  the  small,”  Lobachevskian  geometry 
approximates  Euclidean  geometry.  Also,  since  to  each  angle  IT  (A)  is  asso¬ 
ciated  a  definite  distance  h,  we  see  why  distances,  as  well  as  angles,  are 
absolute  in  Lobachevskian  geometry.  It  further  turns  out  that  the  trigo¬ 
nometrical  formulas  in  Lobachevskian  geometry  are  nothing  but  the  familiar 
formulas  of  spherical  trigonometry  when  the  sides  a,  b,  c  of  the  triangle  are 
replaced  by  a/i ,  b /i,  c/i ,  and  we  are  reminded  of  Lambert’s  suggestion  about 
an  imaginary  sphere,  mentioned  in  Section  3.3. 

It  is  not  the  purpose  of  our  study  to  go  deeply  into  the  Lobachevskian 
non-Euclidean  geometry  resulting  from  the  hypothesis  of  the  acute  angle, 
and  perhaps  we  have  already  indicated  a  sufficient  number  of  propositions 
in  the  geometry  to  give  the  reader  some  idea  of  its  content.  We  have  seen  that 
the  hypothesis  of  the  obtuse  angle  was  discarded  by  all  who  did  research  in 
this  subject  because  it  contradicted  the  assumption  that  a  straight  line  is 
infinite  in  length.  Recognition  of  a  second  non-Euclidean  geometry,  based  on 
the  hypothesis  of  the  obtuse  angle,  was  not  fully  achieved  until  some  years 
later,  when  Bernhard  Riemann,  in  his  probationary  lecture  of  1854,  discussed 
the  concepts  of  boundlessness  and  infiniteness.  With  these  concepts  clarified, 
one  can  realize  an  equally  consistent  geometry  satisfying  the  hypothesis  of 
the  obtuse  angle  if  Euclid’s  Postulates  1,  2,  and  5  are  modified  to  read: 

(1')  Two  distinct  points  determine  at  least  one  stiaight  line. 

(2')  A  straight  line  is  boundless. 

(5')  Any  two  straight  lines  in  a  plane  intersect. 

Much  in  this  new  non-Euclidean  geometry  is  interesting.  For  example 
(see  Figure  18),  it  can  be  shown,  without  great  difficulty,  that  all  the  per- 


O 
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pendiculars  erected  on  the  same  side  of  a  given  straight  line  m  are  concurrent 
in  a  point  0,  and  that  the  lengths  along  these  perpendiculars  from  0  to  the 
line  m  are  all  equal  to  one  another.  Moreover,  this  common  length,  which 
we  shall  denote  by  q,  is  independent  of  which  straight  line  in  the  plane  is 
chosen  for  m.  It  can  also  be  shown  that,  if  A,  B ,  P  are  any  three  points  on 
line  m,  then 

AP  :  AB  =  <  AOP  :  <£  AOB , 

and  that  if  AB  is  taken  equal  in  length  to  q ,  then  ‘fAOB  =  7r/2.  It  now 
follows  that  all  straight  lines  are  finite  and  of  the  same  constant  length  4q, 
for  we  observe  that  OP  coincides  with  OA  when  <£  AOP  =  2i r,  so  that,  under 
such  circumstances,  AP  becomes  the  total  length  of  the  line  m.  But  now 

AP  :  AB  =  2tt  :  <fAOB, 

whence,  by  taking  AB  =  q ,  and  hence  fiAOB  =  7r/2,  we  find  that 

AP  =  4  q. 

Thus  straight  lines,  though  boundless,  are  finite  in  length.  Also,  as  in  the 
case  of  Lobachevskian  geometry,  lengths,  as  well  as  angles,  are  absolute. 

Riemann’s  celebrated  lecture*  of  1854  is  not  detailed  or  specific  in  its 
development  but  is  extraordinarily  rich  in  the  depth  and  generality  of  its 
concepts  and  in  the  originality  of  its  powerful  new  points  of  view.  It  would 
be  difficult  to  point  out  another  paper  which  has  so  greatly  influenced  modern 
geometrical  research.  This  paper  inaugurated  a  second  period  in  the  develop¬ 
ment  of  non-Euclidean  geometry,  a  period  characterized  by  the  employment 
of  the  methods  of  differential  geometry  rather  than  the  previously  used 
methods  of  elementary  synthetic  geometry.  To  this  paper  we  owe  a  consider¬ 
able  generalization  of  the  concept  of  space  which  has  led,  in  more  recent 
times,  to  the  extensive  and  important  theory  of  abstract  spaces;  some  of  this 
theory  has  found  application  in  the  physical  theory  of  relativity.  Literally 
volumes  of  modern  mathematical  research  can  be  traced  to  ideas  advanced 
in  this  remarkable  paper. 

3.5  The  Consistency  and  the 

Significance  of  N on-  Euclidean 
Geometry 

It  was  some  years  after  the  appearance  of  the  work  of  Lobachevsky 
and  Bolyai  that  the  mathematical  world  in  general  paid  much  attention  to 
the  subject  of  non-Euclidean  geometry,  and  several  decades  elapsed  before 
the  full  implication  of  the  invention  was  appreciated.  Most  of  the  develop- 

*The  lecture  was  published  in  1866,  shortly  after  Riemann’s  death.  For  an  English 
translation,  see  D.  E.  Smith  [2],  pp.  411-425. 
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ment  of  the  subject  beyond  the  historical  point  to  which  we  have  carried  it  is 
of  too  advanced  a  nature  to  be  adequately  considered  here.  There  is  one 
important  matter,  however,  in  this  later  development  that  must  be  at  least 
briefly  touched  upon.  Although  Lobachevsky  and  Bolyai  encountered  no 
contradiction  in  their  extensive  investigations  of  the  non-Euclidean  geometry 
based  upon  the  hypothesis  of  the  acute  angle,  and  although  they  even  felt 
confident  that  no  contradiction  would  arise,  there  still  remained  the  possi¬ 
bility  that  such  a  contradiction  or  inconsistency  might  appear  if  the 
investigations  should  be  sufficiently  continued.  To  Beltrami  goes  the  credit 
for  the  first  proof  of  the  consistency  of  this  non-Euclidean  geometry.  In  a 
brilliant  paper,*  published  in  1868,  Beltrami  showed  that  the  plane  non- 
Euclidean  geometry  of  Lobachevsky  and  Bolyai  can  be  represented,  with 
certain  restrictions,  on  a  surface  of  so-called  constant  negative  curvature. 
It  can  be  similarly  shown  that  the  plane  non-Euclidean  geometry  of  Riemann 
can  be  represented  on  a  surface  of  constant  positive  curvature.  Although 
Beltrami’s  methods  are  those  of  differential  geometry  and  cannot  be  fully 
appreciated  without  an  understanding  of  that  field  of  mathematics,  we  can 
rather  simply  explain  the  gist  of  his  idea. 

Of  the  surfaces  of  constant  positive  curvature,  the  simplest  is  the  sphere. 
Now  the  geodesics  on  the  sphere— that  is,  the  curves  of  shortest  length  lying 
on  the  sphere  and  joining  pairs  of  points  on  the  sphere— are  the  great  circles 
of  the  sphere.  If  we  should  interpret  the  plane  of  the  non-Euclidean  geometry 
of  Riemann  as  the  surface  of  a  sphere,  and  the  straight  lines  of  that  non- 
Euclidean  geometry  as  the  great  circles  on  the  sphere,  then  it  is  a  very 
simple  matter  to  show  that  the  postulates  of  the  non-Euclidean  geometry 
hold  in  our  interpretation.  For  example: 

(T)  Two  distinct  points  on  the  sphere  determine  at  least  one  great  circle 
on  the  sphere.  (In  fact,  the  great  circle  is  unique,  unless  the  points  on 
the  sphere  happen  to  be  diametrically  opposite  to  each  other,  in  which 
case  any  number  of  great  circles  may  be  passed  through  the  two  points.) 

(2r)  A  great  circle  on  the  sphere  is  boundless.  (A  great  circle  is  not  infinite 
in  length,  however;  in  fact,  all  great  circles  on  the  sphere  have  the 
same  finite  length.) 

(3')  With  any  point  on  the  sphere  as  center  and  any  great  circle  arc  as 
polar  distance  a  circle  can  be  drawn  on  the  sphere. 

(4')  All  right  angles  on  the  sphere  are  equal  to  one  another. 

(5r)  Any  two  great  circles  on  the  sphere  intersect. 

In  view  of  our  success  in  finding  on  the  surface  of  a  sphere  a  representation 
of  Riemann’s  non-Euclidean  geometry,  it  now  follows  that  the  plane  non- 
Euclidean  geometry  of  Riemann  is  consistent  if  Euclidean  geometry  is  con- 

*E.  Beltrami. 
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sistent,  for  if  there  were  a  deduced  inconsistency  in  this  plane  non-Euclidean 
geometry,  there  would  be  a  corresponding  deduced  inconsistency  in  the 
ordinary  geometry  of  great  circles  on  a  sphere,  and  this  geometry  is  a  part 
of  the  Euclidean  geometry  of  space.  What  we  have  shown  to  be  true  of  the 
great  circles  on  a  sphere  can,  by  the  methods  of  differential  geometry,  be 
shown  to  be  true  of  the  geodesics  on  any  surface  of  constant  positive  curvature. 

Just  as  the  plane  non-Euclidean  geometry  of  Riemann  can  be  realized 
on  a  surface  of  constant  positive  curvature,  so  also  can  the  plane  non- 
Euclidean  geometry  of  Lobachevsky  and  Bolyai  be  similarly  realized  on  a 
surface  of  constant  negative  curvature.  Perhaps  the  simplest  surface  of 
constant  negative  curvature  is  the  pseudosphere ,  or  tractoid.  To  define  this 
surface  we  first  define  a  plane  curve  known  as  the  tractnx.  The  tractrix  may 
be  generated  as  follows:  Imagine  a  piece  of  inextensible  cord  lying  along 
the  positive  y  axis  (see  Figure  19),  one  end  of  the  cord  lying  at  the  origin, 


y 


and  the  other  end  having  attached  to  it  a  small  heavy  pellet.  It  the  end 
lying  at  the  origin  is  now  pulled  along  the  *  axis,  the  pellet  will  trace  a 
kind  of  curve  of  pursuit;  this  curve  is  the  tractrix,  as  shown  in  Figure  19. 
The  curve  is  symmetrical  in  thejy  axis  and  has  the  x  axis  for  an  asymptote. 
Now  the  pseudosphere  is  the  surface  of  revolution  obtained  by  revolving 
the  tractrix  about  its  asymptote  as  an  axis  of  rotation  (see  Figure  20).  It 


FIG.  20 
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can  be  shown  that  the  geometry  of  the  geodesics  on  this  surface  satisfies  the 
postulates  of  the  non-Euclidean  geometry  of  Lobachevsky  and  Bolyai,  after 
there  has  been  suitable  particularization  of  terms,  but  the  proof  here  is  not 
so  simple  as  in  the  previous  case  of  the  sphere,  and  can  perhaps  best  be 
accomplished  by  the  differential  geometry  methods  employed  by  Beltrami 
for  the  general  surface  of  constant  negative  curvature.  This  model,  or 
representation,  in  Euclidean  space,  of  the  plane  non-Euclidean  geometry  of 
Lobachevsky  and  Bolyai  shows  that  this  plane  non-Euclidean  geometry, 
too,  is  consistent  if  Euclidean  geometry  is  consistent;  for,  once  more,  any 
inconsistency  in  the  plane  non-Euclidean  geometry  would  imply  a  corre¬ 
sponding  inconsistency  in  the  Euclidean  geometry  of  geodesics  on  the 
pseudosphere. 

The  pseudosphere  and  its  geodesics,  considered  as  a  representation  of 
the  plane  non-Euclidean  geometry  of  Lobachevsky  and  Bolyai,  is  not  as 
satisfactory  as  the  sphere  and  its  geodesics,  considered  as  a  representation  of 
the  plane  non-Euclidean  geometry  of  Riemann,  for  the  pseudosphere 
represents  only  a  limited  part  of  the  one  non-Euclidean  plane,  whereas  the 
sphere  represents  the  whole  of  the  other  non-Euclidean  plane.  Beltrami 
conjectured,  and  it  has  since  been  proved,  that  there  is  no  surface  of  constant 
negative  curvature  which  can  represent  the  entire  plane  in  the  first  case. 
Also,  it  is  to  be  noted  that  neither  of  our  representations  takes  into  account 
any  solid  non-Euclidean  geometry. 

After  the  discovery  of  the  above  representations  of  the  two  classical 
non-Euclidean  geometries  on  surfaces  of  constant  curvature,  many  other, 
and  in  some  ways  more  satisfying,  representations  of  a  different  nature  were 
devised.  In  Chapter  4  we  shall  examine  an  elementary  representation  of 
Lobachevskian  non-Euclidean  geometry  that  was  devised  by  the  great  French 
mathematician  Henri  Poincare  (1854-1912).  It  is  important  to  observe  that 
this  method  of  models,  or  representations,  does  not  establish  absolute  con¬ 
sistency  but  merely  relative  consistency.  All  we  can  assert  from  such  models 
is  that  the  two  classical  non-Euclidean  geometries  are  consistent  if  Euclidean 
geometry  is  consistent.  The  possibility  of  an  absolute  test  of  consistency  for 
a  postulate  set  will  be  considered  in  a  later  chapter. 

One  consequence  of  the  consistency  of  the  non-Euclidean  geometries  is, 
of  course,  the  final  settlement  of  the  ages-old  problem  of  the  parallel  postu¬ 
late.  The  consistency  established  the  fact  that  the  parallel  postulate  is  inde¬ 
pendent  of  the  other  assumptions  of  Euclidean  geometry  and  proved  the 
impossibility  of  deducing  the  postulate  as  a  theorem  from  those  other 
assumptions,  for  if  the  parallel  postulate  could  be  so  deduced  there  would 
have  to  be  an  inconsistency  in  the  non-Euclidean  systems. 

But  there  are  consequences  of  the  consistency  of  the  non-Euclidean 
geometries  which  are  much  more  far-reaching  than  the  settlement  of  the 
parallel  postulate  problem.  One  of  the  chief  of  these  is  the  liberation  of 
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geometry  from  its  traditional  mold.  The  postulates  of  geometry  become,  for 
the  mathematician,  mere  hypotheses  whose  physical  truth  or  falsity  need 
not  concern  him;  the  mathematician  may  take  his  postulates  to  suit  his 
pleasure,  so  long  as  they  be  consistent  with  one  another.  A  postulate,  as  the 
word  is  employed  by  the  mathematician,  has  nothing  to  do  with  “self- 
evidence”  or  “truth.”  With  the  possibility  of  inventing  such  purely  “artificial” 
geometries  it  became  apparent  that  physical  space  must  be  viewed  as  an 
empirical  concept  derived  from  our  external  experiences,  and  that  the  postu¬ 
lates  of  a  geometry  designed  to  describe  physical  space  are  simply  expressions 
of  this  experience,  like  the  laws  of  a  physical  science.  Euclid’s  parallel 
postulate,  for  example,  insofar  as  it  tries  to  interpret  actual  space,  appears  to 
have  the  same  type  of  validity  as  Galileo’s  law  of  falling  bodies;  that  is,  they 
are  both  laws  of  observation  which  are  capable  of  verification  within  the 
limits  of  experimental  error.  This  point  of  view,  that  geometry  when  applied 
to  actual  space  is  an  experimental  science,  or  a  branch  of  applied  mathe¬ 
matics,  is  in  striking  contrast  to  the  Kantian  theory  of  space  that  dominated 
philosophical  thinking  at  the  time  of  the  discovery  of  the  non-Euclidean 
geometries.  The  Kantian  theory  claimed  that  space  is  a  framework  already 
existing  intuitively  in  the  human  mind,  that  the  axioms  and  postulates  of 
Euclidean  geometry  are  a  priori  judgments  imposed  on  the  mind,  and  that 
without  these  axioms  and  postulates  no  consistent  reasoning  about  space 
can  be  possible.  That  this  viewpoint  is  untenable  was  incontestably  demon¬ 
strated  by  the  invention  of  the  non-Euclidean  geometries. 

Indeed,  the  consistency  of  the  non-Euclidean  geometries  not  only 
liberated  geometry  but  had  a  similar  effect  on  mathematics  as  a  whole. 
Mathematics  emerged  as  an  arbitrary  creation  of  the  human  mind,  and 
not  as  something  essentially  dictated  to  us  of  necessity  by  the  world  in  which 
we  live.  The  matter  is  very  neatly  put  in  the  following  words  of  E.  T.  Bell: 

In  precisely  the  same  way  that  a  novelist  invents  characters,  dialogues,  and 
situations  of  which  he  is  both  author  and  master,  the  mathematician  devises  at 
will  the  postulates  upon  which  he  bases  his  mathematical  systems.  Both  the 
novelist  and  the  mathematician  may  be  conditioned  by  their  environments  in  the 
choice  and  treatment  of  their  material;  but  neither  is  compelled  by  any  extra¬ 
human,  eternal  necessity  to  create  certain  characters  or  invent  certain  systems.  * 

The  invention  of  the  non-Euclidean  geometries,  by  puncturing  a  traditional 
belief  and  breaking  a  centuries-long  habit  of  thought,  dealt  a  severe  blow  to 
the  absolute  truth  viewpoint  of  mathematics.  In  the  words  of  Georg  Cantor, 
“The  essence  of  mathematics  lies  in  its  freedom.” 

Since  we  have  a  number  of  geometries  of  space— the  Euclidean  and  the 
two  classical  non-Euclidean  geometries— the  question  is  often  asked,  “Which 

*Quoted  by  permission  from  E.  T.  Bell,  The  Development  of  Mathematics  (2nd  ed.; 

New  York:  McGraw-Hill  Book  Company,  Inc.,  1945),  p.  330. 
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is  the  true  geometry?”  This  question  is,  of  course,  quite  meaningless  when 
geometry  is  considered  a  branch  of  mathematics,  because  all  we  can  say 
about  truth  with  respect  to  a  branch  of  mathematics  is  that  if  the  postulates 
are  true  then  the  theorems  are  true.  If,  on  the  other  hand,  geometry  is 
considered  a  branch  of  physics,  then  the  question  becomes  more  meaningful. 
But  even  here  we  cannot  give  a  simple  and  definite  answer.  When  it  comes 
to  the  applications  of  several  mathematical  theories  to  a  given  physical 
situation,  we  are  interested  in  that  mathematical  theory  which  best  explains, 
or  most  closely  agrees  with,  the  observed  facts  of  the  physical  situation,  and 
which  will  stand  the  kinds  of  tests  customarily  placed  on  hypotheses  in  any 
field  of  scientific  inquiry.  In  the  present  case,  then,  we  are  interested  in  which 
of  the  Euclidean  and  non-Euclidean  systems  of  geometry  most  closely  agrees 
with  the  observed  facts  of  physical  space.  Now  it  is  not  difficult  to  show  that 
all  three  geometries  under  consideration  fit  our  very  limited  portion  of 
physical  space  equally  well,  and  so  it  would  seem  we  must  be  content  with 
an  indeterminate  answer  until  some  crucial  experimental  test  on  a  great 
scale  can  be  devised  to  settle  the  matter.  Such  a  crucial  test  would  appear  to 
be  the  measurement  of  the  sum  of  the  three  angles  of  a  large  physical  triangle. 
To  date  no  deviation,  exceeding  expected  errors  in  measurement,  from 
180°  has  been  found  in  the  sum  of  the  angles  of  any  physical  triangle.  But, 
we  recall,  the  discrepancy  of  this  sum  from  180°  in  the  two  non-Euclidean 
geometries  is  proportional  to  the  area  of  the  triangle,  and  the  area  of  any 
triangle  so  far  measured  may  be  so  small  that  any  existing  discrepancy  is 
swallowed  by  the  allowed  errors  in  measurement.  There  are  even  some 
reasons  for  believing  that  physical  experiments  will  never  be  able  to  resolve 
the  matter  anyway.  In  this  event,  then,  we  would  do  better  to  ask,  not  which 
is  the  true  geometry,  but  which  is  the  most  convenient  geometry,  and  this 
convenience  may  depend  upon  the  application  at  hand.  Certainly,  for 
drafting,  for  terrestrial  surveying,  and  for  the  construction  of  ordinary 
buildings  and  bridges,  Euclidean  geometry  is  probably  the  most  convenient 
simply  because  it  is  the  easiest  with  which  to  work. 

There  are  physical  studies  where  geometries  other  than  the  Euclidean 
have  been  found  to  be  more  acceptable.  For  example,  Einstein  found  in  his 
study  of  the  general  theory  of  relativity  that  none  of  the  three  geometries 
that  we  have  been  considering  is  adequate,  and  he  adopted  another  kind  of 
non-Euclidean  geometry  that  had  been  first  broached  by  Bernhard  Riemann 
in  his  lecture  of  1854.  Again,  a  recent  study*  of  visual  space  (the  space 
psychologically  observed  by  persons  of  normal  vision)  came  to  the  conclusion 
that  such  space  can  best  be  described  by  Lobachevskian  non-Euclidean 
geometry.  Other  examples  can  be  given. 

Since  the  discovery  of  the  two  non-Euclidean  geometries  which  result 


*R.  K.  Luneburg. 
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from  the  hypotheses  of  the  acute  and  obtuse  angles,  other  non-Euclidean 
geometries  have  been  invented.  Any  geometry  whose  postulates  are  not 
equivalent  to  the  postulates  of  Euclidean  geometry  is  rightfully  called  a 
non-Euclidean  geometry.  As  already  intimated  above  in  connection  with 
Einstein  and  his  general  theory  of  relativity,  Riemann  was  the  originator  of 
a  whole  class  of  non-Euclidean  geometries.  These  have  received  intensive 
study  in  recent  times  and  are  usually  referred  to  as  Riemannian  geometries. 
One  of  the  accomplishments  of  the  twentieth  century  was  the  development 
of  general  non- Riemannian  geometries.  Another  non-Euclidean  geometry, 
invented  through  a  deliberate  application  of  the  postulational  method,  was 
one  by  Max  Dehn  in  which  the  postulate  of  Archimedes  is  suppressed;  such 
a  geometry  is  more  frequently  referred  to  as  a  non- Archimedean  geometry.  The 
invention  of  these  new  geometries  considerably  modified  former  conceptions 
of  mathematics  and  led,  as  we  shall  see  in  the  following  chapters,  to  a  pro¬ 
found  study  of  the  philosophy  and  foundations  of  mathematics,  and  to  the 
further  development  and  understanding  of  the  axiomatic  method. 


_ _ .PROBLEMS 

3.1.1  Give  examples  illustrating  Proclus’  statement,  “It  is  a  known  fact 
that  some  lines  exist  which  approach  each  other  indefinitely,  but  yet  remain 
non-intersecting.” 

3.1.2  Show  that  Playfair’s  postulate  and  Euclid’s  fifth  postulate  are 
equivalent.  (One  may  use  any  of  Euclid’s  first  28  propositions;  see  Appendix, 
Section  A.l.) 

3.1.3  Prove  that  each  of  the  following  statements  is  equivalent  to  Playfair’s 
postulate: 

(a)  If  a  straight  line  intersects  one  of  two  parallel  lines,  it  will  intersect 
the  other  also. 

(b)  Straight  lines  which  are  parallel  to  the  same  straight  line  are 
parallel  to  one  another. 

3.1.4  Replacing  Euclid’s  fifth  postulate  by  the  assumption  that  the  sum 
of  the  angles  of  a  triangle  is  always  equal  to  two  right  angles,  establish  the 
following  consequences: 

(a)  An  exterior  angle  of  a  triangle  is  equal  to  the  sum  of  the  two 
opposite  interior  angles. 

(b)  Through  a  given  point  P  there  can  be  drawn  a  line  making,  with 
a  given  line  m,  not  passing  through  P,  an  angle  less  than  any  given  angle  a:, 
however  small. 

(c)  If  PA  and  QB  are  perpendicular  to  the  line  segment  PQ  at  P  and  Q, 
respectively,  and  lie  on  the  same  side  of  PQ,  and  if  PR  is  any  transversal 
through  P  cutting  QB  in  R ,  then  $iAPR  =  QRP ■ 
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3.1.5  Show  that  Playfair’s  postulate  and  the  statement,  “The  sum  of  the 
angles  of  a  triangle  is  always  equal  to  two  right  angles,”  are  equivalent. 

3.1.6  Replacing  Euclid’s  fifth  postulate  by  the  assumption,  “If  two  angles 
of  one  triangle  are  equal  to  two  angles  of  another  triangle,  then  the  third 
angles  are  also  equal,”  show  that  the  sum  of  the  angles  of  a  triangle  is  always 
equal  to  two  right  angles. 

3.1.7  Find  the  fallacy  in  the  following  “proof,”  given  by  B.  F.  Thibaut 
(1809),  of  Euclid’s  fifth  postulate:  Let  a  straightedge  be  placed  with  its  edge 
coinciding  with  side  CA  of  triangle  ABC.  Rotate  the  straightedge  successively 
about  the  three  vertices  A,  B,  C,  in  the  direction  ABC ,  so  that  it  coincides 
in  turn  with  AB ,  BC,  CA.  When  the  straightedge  returns  to  its  original 
position  it  must  have  rotated  through  four  right  angles.  But  the  whole 
rotation  is  made  up  of  three  rotations  equal  to  the  exterior  angles  of  the 
triangle.  It  now  follows  that  the  sum  of  the  angles  of  the  triangle  must  be 
equal  to  two  right  angles,  and  from  this  follows  Euclid’s  parallel  postulate. 

3.1.8  Find  the  fallacy  in  the  following  “proof,”  given  by  J.  D.  Gergonne 
(1812),  of  Euclid’s  fifth  postulate:  Let  PA  and  QB,  lying  in  the  same  plane 
and  on  the  same  side  of  PQ,  be  perpendicular  to  PQ.  Then  PA  and  QB  are 
parallel.  Let  PG  be  the  last  ray  through  P,  and  lying  within  angle  QPA , 
which  intersects  QB.  Produce  QB  to  a  point  K  beyond  the  point  of  inter¬ 
section  of  PG  with  QB ,  and  draw  PK.  It  follows  that  PG  is  not  the  last  ray 
through  P  which  meets  QB,  and  therefore  all  rays  through  P  and  lying  within 
angle  QPA  must  meet  QB.  Thus  through  P  there  is  only  one  line  parallel 
to  line  QB,  and  Euclid’s  fifth  postulate  follows. 

3.1.9  Find  the  fallacy  in  the  following  “proof,”  given  by  J.  K.  F.  Hauff 
(1819),  of  Euclid’s  fifth  postulate:  Let  AD,  BE,  CF  be  the  altitudes  of  an 
equilateral  triangle  ABC,  and  let  0  be  the  point  of  concurrency  of  these 
altitudes.  In  right  triangle  ADC,  acute  angle  CAD  equals  one  half  acute 
angle  ACD.  Therefore,  in  right  triangle  AEO,  acute  angle  OAE  equals  one 
half  acute  angle  AOE.  A  similar  statement  holds  for  each  of  the  six  small 
right  triangles  of  which  AEO  is  typical.  It  now  follows  that  the  sum  of  the 
angles  of  triangle  ABC  is  equal  to  one  half  the  sum  of  the  angles  about  0, 
that  is,  equal  to  two  right  angles.  But  it  is  known  that  the  existence  of  a 
single  triangle  having  the  sum  of  its  angles  equal  to  two  right  angles  is 
enough  to  guarantee  Euclid’s  fifth  postulate. 

3.2.1  The  reductio  ad  absurdum  method  is  often  useful  in  establishing  the 
converse  of  a  known  theorem.  Apply  this  procedure  to  establish  the  converses 
of  the  following  theorems: 

(a)  CevcC s  theorem :  If  D,  E,  F  are  three  points  on  the  sides  BC,  CA,  AB 
of  a  triangle  ABC  such  that  AD,  BE,  CF  are  concurrent,  then  (BDfDC) 
(CE/EA)  ( AF/FB )  =  1. 

(b)  In  a  triangle  the  greater  angle  lies  opposite  the  greater  side. 

3.2.2  (a)  On  the  earth,  on  opposite  sides  of  the  international  date  line,  it 
can  be  both  Monday  and  Tuesday  at  the  same  time.  Discuss  this  in  relation 
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to  the  law  of  contradiction  (which  asserts  that  “today  cannot  be  both  Monday 
and  Tuesday”),  and  the  law  of  the  excluded  middle  (which  asserts  that  “either 
today  is  Monday  or  today  is  not  Monday”). 

(b)  Does  the  law  of  the  excluded  middle  hold  for  the  following  state¬ 
ments?  (1)  All  circles  are  blue.  (2)  The  king  of  the  United  States  is  more 
than  six  feet  tall. 

This  exercise  shows  that,  although  the  laws  of  contradiction  and  the 
excluded  middle  are  usually  considered  universally  valid,  some  sort  of  limita¬ 
tions  must  be  made  concerning  their  applicability. 

3.2.3  Prove,  by  simple  congruence  theorems  (which  do  not  require  the 
parallel  postulate),  the  following  theorems  about  isosceles  birectangles: 

(a)  The  summit  angles  of  an  isosceles  birectangle  are  equal  to  each 
other. 

(b)  The  line  joining  the  midpoints  of  the  base  and  summit  of  an  isosceles 
birectangle  is  perpendicular  to  both  the  base  and  the  summit. 

(c)  If  perpendiculars  are  drawn  from  the  extremities  of  the  base  of  a 
triangle  upon  the  line  passing  through  the  midpoints  of  the  two  sides,  an 
isosceles  birectangle  is  formed. 

(d)  The  line  joining  the  midpoints  of  the  equal  sides  of  an  isosceles 
birectangle  is  perpendicular  to  the  line  joining  the  midpoints  of  the  base 
and  summit. 

3.2.4  The  hypothesis  of  the  acute  angle  assumes  that  the  equal  summit  angles 
of  an  isosceles  birectangle  are  acute,  or  that  the  fourth  angle  of  a  trirectangle 
(a  quadrilateral  containing  three  right  angles)  is  acute.  In  the  following  we 
shall  assume  the  hypothesis  of  the  acute  angle: 

(a)  Let  ABC  be  any  right  triangle,  and  let  M  be  the  midpoint  of  the 
hypotenuse  AB.  At  A  construct  angle  BAD  =  angle  ABC.  From  M  draw  MP 
perpendicular  to  CB.  On  AD  mark  off  AQ  =  PB ,  and  draw  MQ.  Prove 
triangles  AQA4  and  BPA4  congruent,  thus  showing  that  angle  AQM  is  a  right 
angle  and  that  points  Q,  M,  P  are  collinear.  Then  ACPQ  is  a  trirectangle 
with  acute  angle  at  A.  Now  show  that,  under  the  hypothesis  of  the  acute  angle , 
the  sum  of  the  angles  of  any  right  triangle  is  less  than  two  right  angles. 

(b)  Let  angle  A  of  triangle  ABC  be  not  smaller  than  either  angle  B  or 
angle  C.  Draw  the  altitude  through  A,  and  show,  by  part  (a),  that  under  the 
hypothesis  of  the  acute  angle ,  the  sum  of  the  angles  of  any  triangle  is  less  than  two 
right  angles.  The  difference  between  two  right  angles  and  the  sum  of  the  angles 
of  a  triangle  is  known  as  the  defect  of  the  triangle. 

(c)  Consider  two  triangles,  ABC  and  A'B'C',  in  which  corresponding 
angles  are  equal.  If  A'B'  =  AB,  then  these  triangles  are  congruent.  Suppose 
A'B'  <  AB.  On  AB  mark  off  AD  =  A'B',  and  on  AC  mark  off  AE  =  A'C'. 
Then  triangles  ADE  and  A' B'C'  are  congruent.  Show  that  E  cannot  fall  on 
C,  since  then  angle  BCA  would  be  greater  than  angle  DEA.  Show  also  that 
E  cannot  fall  on  AC  produced,  since  then  DE  would  cut  BC  in  a  point  T 
and  the  sum  of  the  angles  of  triangle  FCE  would  exceed  two  right  angles. 
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Therefore  E  lies  between  A  and  C  and  BCED  is  a  convex  quadrilateral. 
Show  that  the  sum  of  the  angles  of  this  quadrilateral  is  equal  to  four  right 
angles.  But  this  is  impossible  under  the  hypothesis  of  the  acute  angle.  It  thus 
follows  that  we  cannot  have  A' B'  <  AB  and  that,  under  the  hypothesis  of  the 
acute  angle ,  two  triangles  are  congruent  if  the  three  angles  of  one  are  equal  to  the  three 
angles  of  the  other.  In  other  words,  in  the  geometry  resulting  from  the  acute 
angle  hypothesis  similar  figures  of  different  sizes  do  not  exist. 

(d)  A  line  segment  joining  a  vertex  of  a  triangle  to  a  point  on  the 
opposite  side  is  called  a  ceuian.  A  cevian  divides  a  triangle  into  two  sub¬ 
triangles,  each  of  which  may  be  similarly  divided,  and  so  on.  Show  that  if 
a  triangle  is  partitioned  by  cevians  into  a  finite  number  of  subtriangles,  the 
defect  of  the  original  triangle  is  equal  to  the  sum  of  the  defects  of  the  triangles 
in  the  partition. 

3.3.1  Show  that  a  trirectangle  can  be  regarded  as  half  of  an  isosceles 
birectangle. 

3.3.2  A  spherical  degree  for  a  given  sphere  is  defined  to  be  the  spherical  area 
which  is  equivalent  to  (l/720)th  of  the  entire  surface  of  the  sphere.  The 
spherical  excess  of  a  spherical  triangle  is  defined  as  the  excess,  measured  in 
degrees  of  angle,  of  the  sum  of  the  angles  of  the  triangle  above  180°. 

(a)  Show  that  the  area  of  a  lune  whose  angle  is  n°  is  equal  to  2 n 
spherical  degrees. 

(b)  Show  that  the  area  of  a  spherical  triangle,  in  spherical  degrees,  is 
equal  to  the  spherical  excess  of  the  triangle. 

(c)  Show  that  the  area  A  of  a  spherical  triangle  of  spherical  excess  E° 
is  given  by 

7T  r2E° 

A  ~  180° ’ 

where  r  is  the  radius  of  the  sphere.  This  shows  that,  for  a  given  sphere,  the 
area  of  a  spherical  triangle  is  proportional  to  its  spherical  excess. 

3.3.3  Indicate  how  one  might  set  up  a  one-to-one  correspondence  on  a 
sphere  between  spherical  angles  and  great  circle  arcs,  thus  showing  that  in 
the  geometry  on  a  sphere  lengths,  as  well  as  angles,  are  absolute. 

3.3.4  Fill  in  the  details  of  the  following  proof  of  Legendre'1  s  first  theorem'. 
The  sum  of  the  three  angles  of  a  triangle  cannot  be  greater  than  two  right 

angles.  Show  that  the  proof  assumes  the  infinitude  of  the  straight  line. 

Suppose  that  the  sum  of  the  angles  of  triangle  ABC  is  180°  +  d,  and 
that  angle  CAB  is  not  greater  than  either  of  the  other  angles.  Join  A  to  Z), 
the  midpoint  of  BC,  and  produce  AD  its  own  length  to  E.  Show  that  triangles 
BDA  and  CDE  are  congruent;  hence  that  the  sum  of  the  angles  of  triangle 
AEC  is  also  equal  to  180°  -j-  6.  One  of  the  angles  CAE  and  CEA  is  not  greater 
than  (1/2)  fCAB.  Apply  the  same  process  to  triangle  AEC ,  obtaining  a 
third  triangle  whose  angle-sum  is  180°  -f-  6  and  one  of  whose  angles  is  not 
greater  than  (1/22)  fCAB.  By  applying  the  construction  n  times,  a  triangle 
is  reached  whose  angle-sum  is  1 80°  T  d  and  one  of  whose  angles  is  not  greater 
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than  (l/2n)  dgCAB.  By  the  postulate  of  Archimedes  there  exists  an  integer 
k  such  that  kd  >  f.CAB.  Choose  n  so  large  that  2n  >  k.  Then  6  >  (1/271) 
CAB ,  and  the  sum  of  two  of  the  angles  of  the  last  triangle  must  be  greater 
than  180°.  But  this  conclusion  contradicts  Proposition  117. 

3.3.5  In  one  effort  to  eliminate  the  hypothesis  of  the  acute  angle,  Legendre 
tried  to  obtain,  under  this  hypothesis,  a  triangle  containing  a  given  triangle 
at  least  twice.  He  proceeded  as  follows.  Let  ABC  be  any  triangle  such  that 
angle  A  is  not  greater  than  either  of  the  other  two  angles.  Construct  on  side 
BC  a  triangle  DCB  congruent  to  triangle  ABC ,  with  angle  DCB  equal  to 
angle  B  and  angle  DBC  equal  to  angle  C.  Through  D  draw  any  line  cutting 
AB  and  AC  produced  in  E  and  F,  respectively.  Then  triangle  AEF  contains 
triangle  ABC  at  least  twice. 

Show  that  this  construction  assumes  that  through  a  point  within  a  given 
angle  less  than  60°  there  can  always  be  drawn  a  straight  line  intersecting 
both  sides  of  the  angle. 

3.3.6  Assuming  Legendre’s  first  theorem  (see  Problem  3.3.4),  prove  the 
following  sequence  of  theorems  credited  to  Legendre: 

(a)  If  the  sum  of  the  angles  of  a  triangle  is  equal  to  two  right  angles, 
then  the  same  is  true  of  any  triangle  obtained  from  the  given  triangle  by 
drawing  a  cevian  [see  Problem  3.2.4  (d)]  through  one  of  its  vertices. 

(b)  If  there  exists  a  triangle  with  the  sum  of  its  angles  equal  to  two  right 
angles  then  one  can  construct  an  isosceles  right  triangle  having  the  sum  of 
its  angles  equal  to  two  right  angles  and  its  legs  greater  in  length  than  any 
given  line  segment. 

(c)  Legendre's  second  theorem :  If  there  exists  a  single  triangle  having  the 
sum  of  its  angles  equal  to  two  right  angles,  then  the  sum  of  the  angles  of 
every  triangle  will  be  equal  to  two  right  angles. 

(d)  If  there  exists  a  single  triangle  having  the  sum  of  its  angles  less  than 
two  right  angles,  then  the  sum  of  the  angles  of  every  triangle  is  less  than 
two  right  angles. 

3.4.1  (a)  If  ABC  is  a  spherical  right  triangle,  right-angled  at  C,  show  that 
sin  A  =  sin  a/sin  c. 

(b)  If  ABC  is  any  spherical  triangle,  show  that  sin  a  :  sin  b  :  sin  c  — 
sin  A  :  sin  B  :  sin  C. 

This  is  the  law  of  sines  for  spherical  triangles. 

(c)  If  ABC  is  any  spherical  triangle,  show  that 

0(a)  :  0(b)  :  0(c)  =  sin  A  :  sin  B  :  sin  C, 

where  0(r)  denotes  the  circumference  of  a  circle,  on  the  sphere,  whose  polar 
distance  is  r. 

3.4.2  Prove,  by  using  Dedekind’s  postulate  of  continuity  (see  Problem 
2.4.2),  the  following  theorem  in  Lobachevskian  geometry:  If  m  is  a  line  and 
P  a  point  not  on  m,  then  there  are  two  lines  through  P  which  do  not  intersect 
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m ,  which  make  equal  acute  angles  a  with  the  perpendicular  PQ  from  P  to 
m,  and  which  are  such  that  any  line  through  P  lying  within  the  angle  formed 
by  the  two  lines  and  containing  the  perpendicular  PQ  intersects  m,  while 
any  other  line  through  P  does  not  intersect  m. 

3.4.3  Given  a  —  2  Arc  tan  e~h ,  solve  for  h  in  terms  of  a. 

3.4.4  ( for  students  who  have  studied  calculus)  Define,  for  any  complex  number  z> 


yi  ~n— 1 

1  +  <  +  2i  +  "•  +  (^ryi+  •••’ 

~3  _2n — 1 

sin,  =  ,  —  -  +  ...  +(_l)n+1__ —  + 

~2  ~2n — 2 

cos;  =  l--  +  •■•+(-l)"+1(2w_2),+ 


sinh  z  — 


ez  —  e  z 


cosh  z  = 


ez  +  e~ 


It  can  be  shown  that  each  of  the  series  converges  for  all  complex  numbers  z- 
(a)  Show  that,  if  x  is  real, 


whence 


exl  =  cos  x  +  i  sin  x,  e  Xl  =  cos  x  —  i  sin  x, 

exl  +  e~ 


l  ___  ^  OCX 


-x  l 


sin  x  = 


2 1 


COS  X  = 


2 


(b)  In  Lobachevskian  geometry  the  parts  <3,  c,  A  of  a  right  triangle 
ABC  right-angled  at  C  can  be  shown  to  be  related  by 

sinh  a 
sin  A  =  — . 

sinh  c 


Replace  a  by  aji  and  c  by  cji ,  and  obtain  the  corresponding  formula  [see 
Problem  3.4.1(a)] 

sin  a 

sin  A  —  — - 

sin  c 

of  spherical  geometry. 


3.4.5  Prove  that,  in  plane  Riemannian  non-Euclidean  geometry,  all  the 
perpendiculars  erected  on  the  same  side  of  a  given  straight  line  m  are  con¬ 
current  in  a  point  0,  the  lengths  along  these  perpendiculars  from  0  to  the 
line  m  are  all  equal  to  one  another,  and  this  common  length  is  independent 
of  which  straight  line  in  the  plane  is  chosen  for  m. 

3.5.1  Verify,  m  spherical  geometry,  the  following  theorems  of  plane 
Riemannian  non-Euclidean  geometry: 

(a)  All  the  perpendiculars  erected  on  the  same  side  of  a  given  straight 
line  m  are  concmrent  in  a  point  0,  the  lengths  along  these  perpendiculars 
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from  0  to  the  line  m  are  all  equal  to  one  another,  and  this  common  length 
(call  it  q)  is  independent  of  which  straight  line  in  the  plane  is  chosen  for  m. 

(b)  If  A,  B ,  P  are  any  three  points  on  the  line  m,  then  AP  :  AB  = 
<£AOP  :  -ZAOB. 

(c)  All  straight  lines  are  finite  and  of  the  same  constant  length  4 q. 

(d)  Two  triangles  are  congruent  if  the  three  angles  of  one  are  equal 
to  the  three  angles  of  the  other. 

3.5.2  ( for  students  who  have  studied  calculus)  Consider  the  sections  of  a  surface 

S  made  by  planes  containing  the  normal  to  S'  at  a  point  P  on  S.  Of  these 
sections  there  is  one  having  a  maximum  curvature  k  at  P,  and  one  having 
a  minimum  curvature  kf  at  P.  These  two  sections  of  maximum  and  minimum 
curvature  are  generally  at  right  angles  to  each  other,  and  their  curvatures 
at  P  are  called  the  principal  curvatures  of  S  at  P.  The  product  K  =  kk'  is 
called  the  (Gaussian,  or  total)  curvature  of  the  surface  S  at  P.  If  the  two 
principal  curvatures  are  of  the  same  sense,  then  K  is  positive;  if  the  two 
principal  curvatures  are  of  opposite  senses,  then  K  is  negative;  if  one 
principal  curvature  is  zero,  then  K  is  zero.  Gauss,  who  essentially  created 
the  differential  geometry  of  surfaces,  discovered  the  remarkable  theorem  that 
if  a  surface  is  bent  (without  stretching,  creasing,  or  tearing)  the  curvature  of  the 
surface  at  each  point  remains  unaltered. 

(a)  Is  there  a  quadric  surface  whose  curvature  is  everywhere  positive? 
everywhere  negative?  everywhere  zero?  in  some  places  positive  and  in  others 
negative? 

(b)  If  a  surface  can  be  bent  so  as  to  coincide  with  another  surface, 
then  the  two  surfaces  are  said  to  be  applicable  to  each  other.  Show  that  a 
pair  of  applicable  surfaces  have  their  points  in  one-to-one  correspondence 
such  that  at  pairs  of  corresponding  points  the  curvatures  of  the  two  surfaces 
are  equal. 

(c)  Show  that  when  one  surface  is  bent  into  another  surface,  the 
geodesics  of  the  first  surface  go  into  geodesics  of  the  second  surface. 

(d)  Show  that  a  sphere  of  radius  r  has  constant  positive  curvature 
equal  to  1  /r2. 

(e)  Show  that  a  plane  has  constant  zero  curvature. 

(0  Show  that  a  cylindrical  surface  has  constant  zero  curvature.  Is  a 

cylindrical  surface  applicable  to  a  plane? 

(g)  Show  that  if  a  surface  is  applicable  upon  itself  in  all  positions,  its 

curvature  must  be  constant. 

(b)  Show  that  the  only  surfaces  upon  which  free  mobility  of  figures  is 

possible  are  those  of  constant  curvature. 

(i)  Show  that  a  sphere  is  not  applicable  to  a  plane.  (This  is  why,  in 
terrestrial  map  making,  some  sort  of  distortion  in  the  map  is  necessary.) 

3.5.3  ( for  students  who  have  studied  calculus)  The  hyperbolic  sine  and  hyperbolic 

cosine  functions  may  be  defined  (see  Problem  3.4.4)  by  sinh  x  =  (ex  -  e~x)/2 
and  cosh  x  =  (ex  +  e~x)/2. 

(a)  Show  that  cosh2  x  —  sinh2  x  =  1. 

(b)  Show  that  ^(cosh  u)jdx  —  sinh  u  dujdx ,  <i(sinh  u)/dx  =  cosh  u  dujdx. 
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3.5.4  (  for  students  who  have  studied  calculus )  The  graph  of y  =  k  cosh  (x/k)  is 
the  catenary ,  the  form  assumed  by  a  perfectly  flexible  inextensible  chain  of 
uniform  density  hanging  from  two  supports  not  in  the  same  vertical  line. 


y 


Let  this  catenary  (see  Figure  21)  cut  thejy  axis  in  A;  let  P  be  any  point  on 
the  curve,  and  let  F  be  the  foot  of  the  ordinate  through  P;  let  the  tangent 
to  the  curve  at  P  cut  the  x  axis  in  7~,  and  let  Q  be  the  foot  of  the  perpen¬ 
dicular  from  F  on  PT. 

(a)  With  simple  calculus,  show  that  QF  is  constant  and  equal  to  k. 

(b)  With  the  aid  of  integral  calculus,  show  that  QP  is  equal  to  the 
length  of  the  arc  AP. 

(c)  Show  that  if  a  string  AP  is  unwound  from  the  catenary,  the  tracing 
end  A  will  describe  a  curve  AQ  having  the  property  that  the  length  of  the 
tangent  QF  is  constant  and  equal  to  k.  In  other  words,  the  locus  of  Q,  which 
is  an  involute  ol  the  catenary,  is  a  tractrix. 

(d)  It  can  be  shown  that  for  a  surface  of  revolution  the  principal  curva¬ 
tures  (see  Problem  3.5.2)  at  a  point  Q  on  the  surface  are  the  curvature  of 
the  meridian  through  Q  and  the  curvature  of  the  section  through  Q  which 
is  normal  to  the  meridian  through  Q.  If  the  normal  to  the  surface  at  Q 
meets  the  axis  of  revolution  of  the  surface  in  T,  then  the  latter  curvature 
is  known  to  be  equal  to  1  /QT.  Show  that  the  principal  curvatures  at  Q  of 
the  tractoid  obtained  by  revolving  the  tractrix  of  part  (c)  about  the  x  axis 
are  given  by  X/QP  and  1  /QT. 

(e)  Show  that  the  curvature  (see  Problem  3.5.2)  of  the  tractoid  of 
part  (d)  is  constant,  and  everywhere  equal  to  —  1  /k2. 

3.5.5  (for  students  who  have  studied  calculus)  Show  that  the  tractrix  is  an 
orthogonal  trajectory  of  a  family  of  circles  of  constant  radius  k  with  centers 
lying  on  a  straight  line. 

3.5.6  .  (for  students  who  have  studied  calculus )  Show  that  the  geometry  of 
geodesics  on  a  surface  of  constant  zero  curvature  (for  example,  on  a  cylin¬ 
drical  surface)  is  Euclidean.  (See  Problem  3.5.2.) 
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3.5.7  Take  a  fixed  circle,  2,  in  the  Euclidean  plane,  and  interpret  the 
Lobachevskian  plane  as  the  interior  of  2,  a  “’point”  of  the  Lobachevskian 
plane  as  a  Euclidean  point  within  2,  and  a  “line”  of  the  Lobachevskian 
plane  as  that  part  of  a  Euclidean  line  which  is  contained  within  2.  Verify, 
in  this  model,  the  following  statements: 

(a)  Two  “points”  determine  one  and  only  one  “line.” 

(b)  Two  distinct  “lines”  intersect  in  at  most  one  “point.” 

(e)  Through  a  “point”  P  not  on  a  “line”  m  can  be  passed  infinitely 
many  “lines”  not  meeting  “line”  m. 

(d)  Let  the  Euclidean  line  determined  by  the  two  “points”  P  and  Q 
intersect  2  in  S’  and  T,  in  the  order  S,  P,  Q,,  T.  Then  we  interpret  the 
Lobachevskian  “distance”  from  P  to  Q  as  log  [((£S)(PT)/(PS)(Q,P)].  If 
P,  (9,  R  are  three  “points”  on  a  “line,”  show  that 

“distance”  P(9  +  “distance”  QR  =  “distance”  PR. 

(e)  Let  “point”  P  be  fixed  and  let  “point”  (9  move  along  a  fixed  “line” 
through  P  toward  P.  Show  that  “distance”  PQ  — »  °°  . 

This  model  was  devised  by  Felix  Klein  (1849-1929).  With  the  above 
interpretations,  and  with  a  suitable  interpretation  of  “angle”  between  two 
“lines”  it  can  be  shown  that  all  the  assumptions  necessary  for  Euclidean 
geometry,  except  the  parallel  postulate,  are  true  propositions  in  the  geometry 
of  the  model.  We  have  seen,  in  part  (c),  that  the  Euclidean  parallel  postulate 
is  not  such  a  proposition  but  that  the  Lobachevskian  parallel  postulate  holds 
instead.  The  model  thus  proves  that  the  Euclidean  parallel  postulate  cannot 
be  deduced  from  the  other  assumptions  of  Euclidean  geometry,  for  if  it  were 
implied  by  the  other  assumptions  it  would  have  to  be  a  true  proposition 
in  the  geometry  of  the  model. 

3.5.8  Consider  the  following  set  of  postulates  about  certain  objects  called 
“dabas”  and  certain  collections  of  dabas  called  “abas”: 

PI :  Every  aba  is  a  collection  of  dabas. 

P2:  There  exist  at  least  two  dabas. 

P3:  If  p  and  q  are  two  dabas,  then  there  exists  one  and  only  one  aba 
containing  both  p  and  q. 

P4:  If  L  is  an  aba,  then  there  exists  a  daba  not  in  L. 

P5:  If  L  is  an  aba,  and  p  is  a  daba  not  in  L,  then  there  exists  one  and 
only  one  aba  containing  p  and  not  containing  any  daba  that  is  in  L. 

(a)  Devise  a  model  (or  interpretation)  of  the  postulate  set  to  show  that 
P3  cannot  be  deduced  from  the  remaining  postulates  of  the  set. 

(b)  Devise  a  model  (or  interpretation)  of  the  postulate  set  to  show  that 
P5  cannot  be  deduced  from  the  remaining  postulates  of  the  set. 

(c)  Restate  the  postulates  by  interpreting  “aba”  as  “straight  line”  and 
“daba”  as  “point.”  Note  that  P5  is  now  Playfair’s  postulate. 

3.5.9  Legend  has  it  that  in  an  effort  to  determine  whether  physical  space 
is  Euclidean  or  non-Euclidean,  Gauss  measured  the  sum  of  the  angles  of  a 
large  triangle  whose  vertices  were  three  mountain  peaks.  He  found  no 
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deviation  from  180°,  beyond  the  expected  error  of  measurement,  and  thus 
was  unable  to  conclude  anything  except  that  perhaps  the  triangle  was  too 
small.  Assuming  such  a  test  to  be  workable  (in  this  connection  see  Problem 
3.5.10),  perhaps  a  crucial  experiment  could  be  devised  in  connection  with 
the  measurement  of  the  parallax  of  stars. 


In  Figure  22  let  S  represent  a  star  and  Ex  and  E2  two  positions  of  the 
earth  at  opposite  ends  of  a  diameter  of  its  orbit.  Let  C  represent  the  sun, 
and  suppose  CS  is  perpendicular  to  EXE2.  Angle  EXSC  is  then  called  the 
parallax  of  star  S.  There  are  two  methods  of  determining  this  parallax,  the 
direct  method  and  the  differential  method.  In  the  direct  method  one  measures 
angle  SE\C  and  then  calculates  the  parallax  angle.  In  the  differential  method 
one  measures  angle  S'EXS,  where  S'  is  a  “neighboring”  star  whose  distance 
is  known  to  be  so  much  greater  than  that  of  S  that  it  is  taken  as  infinite.  Set 

d  =  90°  -  <fSExC  -  <fS'ExS. 

(a)  Assuming  Euclidean  geometry,  show  that  6  =  0. 

(b)  Assuming  Lobachevskian  geometry,  show  that  6  =  90°  —  II(/?) 
>  0,  where  R  is  the  radius  of  the  earth’s  orbit. 

(c)  Assuming  either  geometry,  show  that  dfExSC  >  d,  and  that  it  then 
follows  that  if  a  positive  lower  bound  can  be  found  for  6,  the  geometry  of 
physical  space  would  be  Lobachevskian  rather  than  Euclidean. 

3.5.10  Because  of  the  apparently  inextricable  entanglement  of  space  and 
matter  it  may  be  impossible  to  determine  by  astronomical  methods  whether 
physical  space  is  Euclidean  or  non-Euclidean.  Since  all  measurements  involve 
both  physical  and  geometrical  assumptions,  an  observed  result  can  be 
explained  in  many  different  ways  by  merely  making  suitable  compensatory 
changes  in  our  assumed  qualities  of  space  and  matter.  For  example,  it  is 
quite  possible  that  a  discrepancy  observed  in  the  angle-sum  of  a  triangle 
could  be  explained  by  preserving  the  assumptions  of  Euclidean  geometry 
but  at  the  same  time  modifying  some  physical  law,  such  as  some  law  of 
optics.  And  again,  the  absence  of  any  such  discrepancy  might  be  compatible 
with  the  assumptions  of  a  non-Euclidean  geometry,  together  with  some 
suitable  adjustments  in  our  assumptions  about  matter.  On  these  grounds 
Henri  Poincare  maintained  the  impropriety  of  asking  which  geometry  is  the 
true  one.  To  clarify  this  viewpoint,  Poincare  devised  an  imaginary  universe 


PROBLEMS  89 


2  occupying  the  interior  of  a  sphere  of  radius  R  in  which  he  assumed  the 
following  physical  laws  to  hold: 

(1)  At  any  point  P  of  2  the  absolute  temperature  T  is  given  by 
T  —  k(R 2  —  r2),  where  r  is  the  distance  of  P  from  the  center  of  2  and  k 
is  a  constant. 

(2)  The  linear  dimensions  of  a  material  body  vary  directly  with  the 
absolute  temperature  of  the  body’s  locality. 

(3)  All  material  bodies  in  2  immediately  assume  the  temperatures  of 
their  localities. 

(a)  Show  that  it  is  possible  for  an  inhabitant  of  2  to  be  quite  unaware 
of  the  above  three  physical  laws  holding  in  his  universe. 

(b)  Show  that  an  inhabitant  of  2  would  feel  that  his  universe  is  infinite 
in  extent  on  the  grounds  that  he  would  never  reach  a  boundary  after  taking 
a  finite  number  N  of  steps,  no  matter  how  large  N  may  be  chosen. 

(c)  Show  that  geodesics  in  2  are  curves  bending  toward  the  center 
of  2.  As  a  matter  of  fact,  it  can  be  shown  that  the  geodesic  through  two 
points  A  and  B  of  2  is  the  arc  of  a  circle  or  straight  line  through  A  and  B 
which  cuts  the  bounding  sphere  orthogonally. 

(d)  Let  us  impose  one  further  physical  law  on  the  universe  2  by 
supposing  that  light  travels  along  the  geodesics  of  2.  This  condition  can  be 
physically  realized  by  filling  2  with  a  gas  having  the  proper  index  of  refrac¬ 
tion  at  each  point  of  2.  Show,  now,  that  the  geodesics  of  2  will  “look 
straight”  to  an  inhabitant  of  2. 

(e)  Show  that  in  the  geometry  of  geodesics  in  2  the  Lobachevskian 
parallel  postulate  holds,  so  that  an  inhabitant  of  2  would  believe  that  he 
lives  in  a  non-Euclidean  world.  Here  we  have  a  piece  of  ordinary,  and 
supposedly  Euclidean,  space,  which,  because  of  different  physical  laws, 
appears  to  be  non-Euclidean. 


_  CHAPTER  FOUR 

HILBERT’S  “GRUNDLAGEN” 


4.1  The  Work  of  Pasch ,  Peano, 
and  Pieri 

The  modern  postulational  method  in  mathematics,  with  the  present-day 
deliberate  drive  toward  more  and  ever  more  generalization  and  abstraction 
in  the  subject,  can  be  traced  directly  to  two  sources  of  approximately 
simultaneous  origin — the  bold  creation  by  Lobachevsky  and  Bolyai  of 
non-Euclidean  geometry  and  the  penetrating  discovery  by  British  mathe¬ 
maticians  of  abstract  algebraic  structure.  We  have  discussed  the  first  of  these 
sources  in  the  preceding  chapter  and  we  shall  deal  with  the  second  one  in  the 
succeeding  chapter.  In  the  present  chapter  we  wish  to  consider  a  bit  further 
some  of  the  historical  consequences  of  the  first  of  the  two  fountainheads. 

After  the  discovery  of  non-Euclidean  geometry,  a  need  was  felt  for  a 
truly  satisfactory  postulational  treatment  of  Euclidean  geometry.  All  hidden, 
or  tacit,  assumptions  had  to  be  ferreted  out,  and  a  logically  acceptable  set  of 
underlying  postulates  for  the  subject  had  to  be  clearly  and  unequivocally 
put  forth.  Such  an  organization  of  Euclidean  geometry  was  first  accomplished 
in  1882  by  the  German  mathematician  Moritz  Pasch. 

In  his  treatment  of  Euclidean  geometry,  Pasch  recognized  the  important 
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distinction  between  explicit  and  implicit  definition.  Most  people  are  familiar 
with  the  concept  of  explicit  definition,  inasmuch  as  this  is  the  type  of  defini¬ 
tion  most  frequently  employed.  In  an  explicit  definition  a  new  term  is 
expressed  by  means  of  terms  which  are  already  accepted  in  the  vocabulary. 
In  a  technical  sense,  then,  a  new  term  introduced  in  such  a  manner  serves 
merely  as  an  abbreviation  for  a  complex  combination  of  terms  already 
present.  Thus  a  new  term  introduced  by  explicit  definition  is  really  arbitrary, 
though  convenient,  and  may  be  entirely  dispensed  with,  although  then  the 
discourse  in  which  the  vocabulary  is  to  be  employed*  would  immediately 
increase  in  complexity. 

Implicit  definition,  on  the  other  hand,  is  relatively  unfamiliar  to  most 
people,  though  such  a  notion  is  indispensable  in  logical  theory.  The  necessity 
for  implicit  definition  is  due  to  the  fact  that  it  is  impossible  to  define  all 
terms  explicitly  if  we  wish  to  avoid  circularity.  It  is  impossible,  for  example, 
to  define  word  A  in  terms  of  word  B ,  then  word  B  in  terms  of  word  C,  and 
so  on  indefinitely,  for  such  a  procedure  would  imply  an  infinite  number 
of  words  in  the  vocabulary.  Of  course  the  dictionary  makes  an  attempt  to 
define  all  words  explicitly  through  the  admitted  use  of  circularity,  but  it  is 
hoped  that  the  person  using  the  dictionary  has  developed  an  adequate 
vocabulary  so  that  the  words  in  terms  of  which  some  unknown  word  is 
defined  are  already  familiar  to  him. 

Every  person  is  aware  of  his  own  attempt  to  introduce  new  words  into 
his  vocabulary  by  observing  how  these  words  are  used  by  others.  This  idea 
of  defining  a  word  through  the  medium  of  the  context  in  which  it  occurs  is  the 
basic  idea  of  implicit  definition.  In  a  logical  discourse,  since  not  all  technical 
terms  can  be  explicitly  defined,  there  must  be  some  whose  meanings  can  be 
realized  only  by  observing  the  context  in  which  they  are  employed.  A  logical 
discourse,  in  other  words,  must  accept  a  relatively  small  number  of  primitive 
technical  terms  which  can  be  used  for  explicitly  defining  all  the  other 
technical  terms  which  occur  in  the  discourse,  these  primitive  terms  of  the 
discourse  receiving  no  definitions  except  those  given  to  them  implicitly  by 
their  presence  in  the  adopted  postulates  of  the  discourse. 

Whereas  Euclid  attempted  a  kind  of  explicit  definition  of  the  terms 
point ,  line ,  and  plane ,  for  example,  Pasch  accepted  these  as  primitive,  or 
irreducible,  terms  in  his  development  of  Euclidean  geometry;  he  considered 
them  only  implicitly  defined  by  the  basic  propositions  that  he  assumed  as 
postulates  in  his  treatment.  These  assumed  basic  propositions  were  described 
by  Pasch  as  nuclear.  Although  the  origin  of  the  nuclear  propositions  might 
be  found  in  empirical  considerations,  Pasch  emphasized  that  they  are  to  be 
enunciated  without  regard  to  any  empirical  significance.  He  declared  that 
the  creation  of  a  truly  deductive  science  demands  that  all  logical  deductions 
must  be  independent  of  any  meanings  which  might  be  attached  to  the 
various  concepts.  In  fact,  if  it  becomes  necessary  at  any  point  in  the  con- 
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struction  of  a  proof  to  refer  to  certain  interpretations  of  the  basic  terms, 
then  that  is  sufficient  evidence  that  the  proof  is  logically  inadequate.  On  the 
other  hand,  by  keeping  all  of  the  work  purely  formal,  various  applications 
of  the  discourse  may  be  obtained  by  assigning  different  suitable  meanings  to 
the  primitive  terms  employed.  From  this  point  of  view,  Euclidean  geometry 
is  essentially  a  symbolic  system  whose  validity  and  possibility  for  further 
development  do  not  depend  upon  any  specific  meanings  given  to  the  basic 
terms  employed  in  the  postulates  of  the  geometry;  Euclidean  geometry  is 
reduced  to  a  pure  exercise  in  logical  syntax.  Where  Euclid  appears  to  have 
been  guided  by  visual  imagery,  and  thus  subjected  to  the  making  of  tacit 
assumptions,  Pasch  attempted  to  avoid  this  pitfall  by  deliberately  considering 
geometry  as  a  purely  hypothetico-deductive  system.  Pasch  profoundly 
influenced  postulational  thinking  in  geometry,  and  later  works  in  the  field 
attempted  to  maintain  the  standards  of  rigor  which  he  had  introduced. 

Following  Pasch,  the  Italian  mathematician  Guiseppi  Peano  (1858-1932) 
gave,  in  1889,  a  new  postulational  development  of  Euclidean  geometry. 
Like  Pasch,  Peano  based  his  treatment  upon  certain  primitive  terms,  among 
which  are  an  entity  called  a  “point”  and  a  relation  among  points  designated 
by  “betweenness.”  From  many  points  of  view  Peano’s  work  is  largely  a 
translation  of  Pasch’s  treatise  into  the  notation  of  a  symbolic  logic  which 
Peano  introduced  to  the  mathematical  world.  In  Peano’s  version  no  empiri¬ 
cism  is  found;  his  geometry  is  purely  formalistic  by  virtue  of  the  fact  that 
it  is  constructed  as  a  calculus  of  relations  between  variables.  Here  we  have 
the  mathematician’s  ultimate  cloak  of  protection  from  the  pitfall  of  over¬ 
familiarity  with  his  subject  matter.  We  have  seen  that  Euclid,  working 
with  visual  diagrams  in  a  field  of  study  with  which  he  was  very  familiar, 
unconsciously  made  numerous  hidden  assumptions  which  were  not  guaran¬ 
teed  to  him  by  his  axioms  and  postulates.  To  protect  himself  from  similar 
prejudice,  Peano  conceived  the  idea  of  symbolizing  his  primitive  terms  and 
his  logical  processes  of  thought.  Clearly,  if  one  is  to  say,  “Two  x’s  determine 
ay,”  instead  of,  “Two  points  determine  a  straight  line,”  one  is  not  so  likely 
to  be  biased  by  preconceived  notions  about  “points”  and  “straight  fines,” 
and  if  a  symbolic  logic  is  employed  in  the  reasoning,  one  is  not  so  likely  to 
fall  into  fallacies  stemming  from  slippery  intuition  and  other  modes  of  loose 
reasoning.  The  derivation  of  theorems  becomes  an  algebraic  process  in  which 
only  symbols  and  formulas  are  employed,  and  geometry  is  reduced  to  a 
strictly  formal  process  which  is  entirely  independent  of  any  interpretations 
of  the  symbols  involved. 

We  shall  hear  more  of  Peano  in  subsequent  chapters,  for  his  interest  in 
formal  systems  extended  beyond  geometry.  Today  Peano  is  recognized  not 
only  as  one  of  the  truly  original  geniuses  in  mathematics,  but  also  as  one  of 
the  creators  of  modern  mathematical  logic.  He  continued  the  work  in  the 
field  of  symbolic  logic  that  had  been  commenced  by  George  Boole  and  his 
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successors,  essentially  devising  a  kind  of  universal  shorthand  for  the  mathe¬ 
matics  of  his  time.  The  important  subject  of  symbolic,  or  mathematical,  logic 
will  be  considered  more  fully  in  a  later  chapter.  For  the  present,  we  wish  to 
point  out  that  Peano’s  logical  analysis  of  geometry  rendered  even  clearer  the 
concept  of  geometry  as  an  abstract  hypothetico-deductive  system  with  no 
intrinsic  content  beyond  that  implied  by  the  postulates. 

Another  Italian  mathematician,  Mario  Pieri  (1860-1904),  employed, 
in  1899,  in  his  studies  of  Euclidean  geometry,  a  quite  different  approach  from 
that  of  his  predecessors.  He  considered  the  subject  of  his  study  to  be  an 
aggregate  of  undefined  elements  called  “points”  and  an  undefined  concept 
of  “motion.”  Pieri’s  first  five  postulates  will  indicate  the  important  role 
assigned  to  the  concept  of  motion.  They  are 

1.  There  is  given  an  aggregate  S  of  points  containing  at  least  two  distinct  members. 

2.  A  motion  establishes  a  pairing  of  the  points  of  S  such  that  to  each  point  P  of  S 
there  corresponds  some  point  P'  of  S.  For  any  motion  which  establishes  a  corre¬ 
spondence  between  the  points  P  of  S  and  the  points  Pr  of  S,  there  is  an  inverse 
motion  which  establishes  a  correspondence  between  the  points  Pr  of  S  and  the 
points  P  o/S. 

3.  The  resultant  of  two  motions  performed  successively  is  equivalent  to  a  single 
motion. 

As  a  consequence  of  Postulates  2  and  3,  the  motion  equivalent  to  some 
motion  followed  by  its  inverse  is  a  motion  which  makes  each  point  of  S 
correspond  to  itself,  or,  in  other  words,  leaves  each  point  of  S  fixed.  This 
motion  is  called  the  identical  motion.  A  motion  which  is  not  the  identical 
motion  is  called  an  effective  motion. 

4.  For  any  two  distinct  points  A  and  B,  there  exists  an  effective  motion  which  leaves 
A  and  B  fixed.  Such  a  motion  may  be  referred  to  as  a  rotation  motion  about 
the  two  points  A  and  B. 

5.  If  there  is  an  effective  motion  which  leaves  three  points  A,  B,  C  fixed,  then  every 
motion  which  leaves  A  and  B  fixed  also  leaves  C  fixed. 

As  a  result  of  Postulate  5,  it  is  now  possible  to  define  the  straight  line 
determined  by  two  points  A  and  B :  “The  straight  line  AB  is  the  aggregate 
of  points  which  remain  fixed  under  any  effective  motion  which  leaves  A 
and  B  fixed.” 

It  can  be  seen  from  the  brief  sample  of  Pieri’s  postulate  system  given 
above  that  the  concern  is  with  the  results  of  a  given  motion  rather  than  with 
the  motion’s  actual  nature.  The  modern  mathematician  would  speak  of 
transformations ,  or  mappings ,  instead  of  motions.  It  should  be  noted  that  Pieri  s 
idea  of  motion,  which  is  essentially  that  of  rigid  displacement,  is  adaptable 
to  the  Euclidean  superposition  proofs. 
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Although  Pieri’s  treatment  of  Euclidean  geometry  received  no  wide 
acceptance,  the  development  of  certain  modern  notions  is  apparent  in  his 
work.  Pieri  was  considering  Euclidean  geometry  as  a  study  of  the  properties 
and  relations  of  configurations  of  points  that  remain  invariant  under  the 
set  of  all  rigid  displacements.  We  shall  say  more  about  this  idea  later  on, 
for  the  general  concept  of  building  a  geometry  as  the  invariant  theory  of  a 
group  of  transformations  pointed  the  way  to  important  studies  in  the  late 
nineteenth  and  early  twentieth  centuries. 

4.2  Hilbert’s  Grundlagen  der 

Geometrie 

The  eminent  German  mathematician,  Professor  David  Hilbert  (1862- 
1943),  gave  a  course  of  lectures  on  the  foundations  of  Euclidean  geometry 
at  the  University  of  Gottingen  during  the  1898-1899  winter  term.  These 
lectures,  which  concerned  themselves  with  a  postulational  discussion  of 
Euclidean  geometry,  were  rearranged  and  published  in  a  slender  volume 
in  June,  1899,  under  the  title  Grundlagen  der  Geometrie  (Foundations  of 
Geometry).  This  work,  in  its  various  improved  revisions,  is  today  a  classic 
in  its  field;  it  has  done  far  more  than  any  other  single  work  since  the  discovery 
of  non-Euclidean  geometry  to  promote  the  modern  axiomatic  method  and 
to  shape  the  character  of  a  good  deal  of  present-day  mathematics.  The 
influence  of  the  book  was  immediate.  A  French  edition  appeared  soon  after 
the  German  publication,  and  an  English  version,  translated  by  E.  J. 
Townsend,  appeared  in  1902.*  The  work  went  through  seven  German 
editions  during  the  author’s  lifetime,  the  seventh  edition  appearing  in  1930. 
An  eighth  German  edition,  a  revision  and  enlargement  by  Paul  Bernays, 
appeared  in  1956,  and  a  ninth  edition,  also  by  Bernays,  appeared  in  1962. 

By  developing  a  postulate  set  for  plane  and  solid  Euclidean  geometry 
that  does  not  depart  too  greatly  in  spirit  from  Euclid’s  own,  and  by  employing 
a  minimum  of  symbolism,  Hilbert  succeeded  in  convincing  mathematicians, 
to  a  far  greater  extent  than  had  Pasch  and  Peano,  of  the  purely  hypothetico- 
deductive  nature  of  geometry.  But  the  influence  of  Hilbert’s  work  went  far 
beyond  this,  for,  backed  by  the  author’s  great  mathematical  authority,  it 
firmly  implanted  the  postulational  method,  not  only  in  the  field  of  geometry, 
but  also  in  essentially  every  other  branch  of  mathematics  of  the  twentieth 
century.  The  stimulus  to  the  development  of  the  foundations  of  mathematics 
provided  by  Hilbert’s  little  book  is  difficult  to  overestimate.  Lacking  the 
strange  symbolism  of  the  works  of  Pasch  and  Peano,  Hilbert’s  work  can  be 
read,  in  great  part,  by  any  intelligent  student  of  high  school  geometry. 

Whereas  Euclid  made  a  distinction  between  “axioms”  and  “postulates,” 
modern  mathematicians  consider  these  two  terms  synonymous,  and  designate 

*D.  Hilbert  [1]  (Bibliography). 
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all  the  assumed  propositions  of  a  logical  discourse  by  either  term.  From  this 
point  of  view,  Hilbert’s  treatment  of  plane  and  solid  Euclidean  geometry 
rests  upon  21  axioms  or  postulates,  and  these  involve  six  primitive,  or 
undefined,  terms.  For  simplicity  we  shall  here  consider  only  those  postu¬ 
lates  of  Hilbert’s  set  which  apply  to  plane  geometry.  Under  this  limitation 
there  are  only  15  postulates  and  five  primitive  terms. 

The  primitive  terms  in  Hilbert’s  treatment  of  plane  Euclidean  geometry 
are  pointy  line  (meaning  straight  line),  on  (a  relation  between  a  point  and  a  line), 
between  (a  relation  between  a  point  and  a  pair  of  points),  and  congruent 
(a  relation  between  pairs  of  points  and  between  configurations  called  angles , 
which  are  explicitly  defined  in  the  treatment).  For  convenience  of  language, 
the  phrase  “point  A  is  on  line  m”  will  frequently  be  stated  alternatively  by 
the  equivalent  phrases,  “line  m  passes  through  point  A ”  or  “line  m  contains 
point  A .” 

We  shall  now  list  Hilbert’s  15  postulates  for  plane  geometry,  inter¬ 
spersing  these  postulates  with  occasional  definitions  when  needed.  The 
statements  of  the  postulates  are  taken,  with  some  slight  modifications  for 
the  sake  of  clarity,  from  the  seventh  (1930)  edition  of  Hilbert’s  Grundlagen 
der  Geometrie.  The  postulates  will  be  presented  in  certain  related  groups. 

Group  I:  Postulates  of  connection 

I— 1.  There  is  one  and  only  one  line  passing  through  any  two  given  distinct  points. 

I— 2.  Every  line  contains  at  least  two  distinct  points ,  and  for  any  given  line  there  is  at 
least  one  point  not  on  the  line. 

Group  II:  Postulates  of  order 

II—  1.  If  point  C  is  between  points  A  and  B,  then  A,  B,  C  are  all  on  the  same  line ,  and 

C  is  between  B  and  A,  and  B  is  not  between  C  and  A,  and  A  is  not  between  C  and  B. 

1 1-2.  For  any  two  distinct  points  A  and  B  there  is  always  a  point  G  which  is  between 
A  and  B,  and  a  point  D  which  is  such  that  B  is  between  A  and  D. 

II-3.  If  A,  B,  C  are  three  distinct  points  on  the  same  line ,  then  one  of  the  points  is  between 

the  other  two. 

Definition.  By  the  segment  AB  is  meant  the  points  A  and  B  and  all  points  which 
are  between  A  and  B.  Points  A  and  B  are  called  the  end  points  of  the  segment.  A 
point  C  is  said  to  be  on  the  segment  AB  if  it  is  A  or  B  or  some  point  between  A  and  B. 

Definition.  Two  lines,  a  line  and  a  segment,  or  two  segments,  are  said  to 
intersect  if  there  is  a  point  which  is  on  both  of  them. 

Definition.  Let  A,  B ,  C  be  three  points  not  on  the  same  line.  Then  by  the 

triangle  ABC  is  meant  the  three  segments  AB ,  BC,  CA.  The  segments  AB,  BC,  CA 

are  called  the  sides  of  the  triangle,  and  the  points  A,  B,  C  are  called  the  vertices  of 
the  triangle. 

II— 4.  (Pasch’s  postulate)  A  line  which  intersects  one  side  of  a  triangle  but  does  not 
pass  through  any  of  the  vertices  of  the  triangle  must  also  intersect  another  side  of  the  triangle. 
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Group  III:  Postulates  of  congruence 

III— 1 .  If  A  and  B  are  distinct  points  and  if  A'  is  a  point  on  a  line  m,  then  there  are 
two  and  only  two  distinct  points  Br  and  B  "  on  m  such  that  the  pair  of  points  A' ,  B'  is  congruent 
to  the  pair  A,  B  and  the  pair  of  points  A',  B  "  is  congruent  to  the  pair  A,  B;  moreover ,  A' 
is  between  B'  and  B  " . 

1 1 1-2.  If  two  pairs  of  points  are  congruent  to  the  same  pair  of  points ,  then  they  are 
congruent  to  each  other. 

1 1 1-3.  If  point  C  is  between  points  A  and  B  and  point  O'  is  between  points  A'  and  B', 
and  if  the  pair  of  points  A,  C  is  congruent  to  the  pair  A',  C',  and  the  pair  of  points  C,  B  is 
congruent  to  the  pair  C' ,  B',  then  the  pair  of  points  A,  B  is  congruent  to  the  pair  A',  BT 

Definition.  Two  segments  are  said  to  be  congruent  if  the  end  points  of  the 
segments  are  congruent  pairs  of  points. 

Definition.  By  the  ray  AB  is  meant  the  set  of  all  points  consisting  of  those 
which  are  between  A  and  B,  the  point  B  itself,  and  all  points  C  such  that  B  is  between 
A  and  C.  The  ray  AB  is  said  to  emanate  from  point  A. 

Theorem.  If  B;  is  any  point  on  the  ray  AB,  then  the  rays  AB'  and  AB  are  identical. 

Definition.  By  an  angle  is  meant  a  point  (called  the  vertex  of  the  angle)  and  two 
rays  (called  the  sides  of  the  angle)  emanating  from  the  point.  By  virtue  of  the  above 
theorem,  if  the  vertex  of  the  angle  is  point  A  and  if  B  and  C  are  any  two  points 
other  than  A  on  the  two  sides  of  the  angle,  we  may  unambiguously  speak  of  the 
angle  BAC  (or  CAB). 

Definition.  If  ABC  is  a  triangle,  then  the  three  angles  BAC ,  CBA,  ACB  are 
called  the  angles  of  the  triangle.  Angle  BAC  is  said  to  be  included  by  the  sides  AB 
and  AC  of  the  triangle. 

1 1 1-4.  If  BAC  is  an  angle  whose  sides  do  not  lie  in  the  same  line ,  and  if  A'  and  B' 
are  two  distinct  points,  then  there  are  two  and  only  two  distinct  rays,  A' O'  and  A'C  ",  such 
that  angle  B  A/C/  is  congruent  to  angle  BAC  and  angle  B,A,C  "  is  congruent  to  angle  BAC; 
moreover,  if  D'  is  any  point  on  the  ray  A'C'  and  D  "  is  any  point  on  the  ray  A'C  ",  then  the 
segment  D'D  "  intersects  the  line  determined  by  A'  and  BT 

1 1 1-5.  Every  angle  is  congruent  to  itself. 

Ill— 6.  If  two  sides  and  the  included  angle  of  one  triangle  are  congruent,  respectively,  to 
two  sides  and  the  included  angle  of  another  triangle,  then  each  of  the  remaining  angles  of  the 
first  triangle  is  congruent  to  the  corresponding  angle  of  the  second  triangle. 

Group  IV:  Postulate  of  parallels 

(Playfairs  postulate)  Through  a  given  point  A  not  on  a  given  line  m  there 
passes  at  most  one  line  which  does  not  intersect  m. 


Group  \  :  Postulates  of  continuity 

V-l.  (Postulate  of  Archimedes)  If  A,  B,  C,  D  are  four  distinct  points,  then  there 
is,  on  the  ray  AB,  a  finite  set  of  distinct  points  Ai,  A2  •  •  • ,  An  such  that  (1)  each  of  the  pairs 

A,  Ai;  A1}  A2;  A2,  A3;  •  •  • ;  An_l5  An  is  congruent  to  the  pair  C,  D,  and  (2)  B  is  between 
A  and  An. 


V  2.  (Postulate  of  completeness)  / he  points  of  a  line  constitute  a  system  of  points 
such  that  no  new  points  can  be  assigned  to  the  line  without  causing  the  line  to  violate  at  least 
one  of  the  nine  postulates  I— 1 ,  1-2,  II— 1,  II-2,  II— 3,  1 1-4,  III— 1 ,  HI-2,  V-l. 
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Upon  these  fifteen  postulates  rests  the  entire  extensive  subject  of  plane 
Euclidean  geometry !  To  develop  the  geometry  appreciably  from  these  pos¬ 
tulates  is  too  long  a  task  for  us  to  undertake  here,  but  we  shall  add  a  few 
words  concerning  the  significance  of  some  of  the  postulates. 

The  postulates  of  the  first  group  define  implicitly  the  idea  expressed 
by  the  primitive  term  “on,”  and  they  establish  a  connection  between  the 
two  primitive  entities,  “points”  and  “lines.” 

The  postulates  of  the  second  group  were  first  studied  by  Pasch,  and 
they  define  implicitly  the  idea  expressed  by  the  primitive  term  “between.” 
In  particular,  they  assure  us  of  the  existence  of  an  infinite  number  of  points 
on  a  line  and  that  a  line  is  not  terminated  at  any  point,  and  they  guarantee 
that  the  order  of  points  on  a  line  is  serial  rather  than  cyclical.  Postulate 
II— 4  (Pasch’s  postulate)  differs  from  the  other  postulates  of  the  group,  for, 
since  it  involves  points  not  all  on  the  same  line,  it  gives  information  about 
the  plane  as  a  whole.  The  postulates  of  order  are  of  historical  interest 
inasmuch  as  Euclid  completely  failed  to  recognize  any  of  them.  It  is  this 
serious  omission  on  Euclid’s  part  that  permits  one,  using  only  Euclid  s  list 
of  assumptions,  to  derive  paradoxes  by  applying  apparently  sound  reasoning 
to  misconceived  figures. 

The  postulates  of  the  third  group  define  implicitly  the  idea  expressed 
by  the  primitive  term  “congruence”  as  applied  to  pairs  of  points  and  to 
angles.  These  postulates  are  included  in  order  to  circumvent  the  necessity 
of  dealing  with  the  concept  of  motion.  For  example,  it  is  interesting  to  note 
how,  in  Postulate  III— 6,  Hilbert  introduces  the  congruency  of  triangles 
without  employing  Euclid’s  method  of  superposition,  still  common  in  many 
high  school  textbooks. 

The  Playfair  parallel  postulate  appears  as  the  only  postulate  in  Group 
IV;  it  is,  of  course,  equivalent  to  Euclid’s  parallel  postulate.  Using  the 
postulates  of  the  first  three  groups  one  can  prove  that  there  is  at  least  one 
line  through  the  given  point  A  and  not  intersecting  the  given  line  m. 

The  first  postulate  of  the  last  group  (the  postulate  of  Archimedes) 
corresponds  to  the  familiar  process  of  estimating  the  distance  from  one  point 
of  a  line  to  another  by  the  use  of  a  measuring  stick;  it  guarantees  that  if  we 
start  at  the  one  point  and  lay  off  toward  the  second  point  a  succession  of 
equal  distances  (equal  to  the  length  of  the  measuring  stick)  we  will  ultimately 
pass  the  second  point.  Upon  this  postulate  can  be  made  to  depend  the  entire 
theory  of  measurement  and,  in  particular,  Euclid’s  theory  of  proportion. 
The  final  postulate  (the  postulate  of  completeness)  is  not  required  for  the 
derivation  of  the  theorems  of  Euclidean  geometry,  but  it  makes  possible  the 
establishment  of  a  one-to-one  correspondence  between  the  points  on  any 
line  and  the  set  of  all  real  numbers,  and  is  necessary  for  the  free  use  of  the 
real  number  system  in  analytic,  or  coordinate,  geometry.  It  can  be  shown 
that,  in  the  presence  of  the  other  thirteen  postulates,  these  last  two  postulates 
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are  equivalent  to  the  postulate  of  Dedekind,  and  therefore,  if  we  wish,  they 
can  be  replaced  by  this  postulate.  In  other  words,  the  postulates  of  Group  V 
can  be  replaced  by  the  following  Alternative  Group  V. 

Alternate  Group  V 

Definitions.  Consider  a  segment  AB.  Let  us  call  one  end  point,  say  A,  the 
origin  of  the  segment,  and  the  other  end  point,  B,  the  extremity  of  the  segment.  Given 
two  distinct  points  M  and  N  of  AB ,  we  say  that  M  precedes  N  (or  N follows  M )  if  M 
coincides  with  the  origin  A  or  lies  between  A  and  JV.  A  segment  AB,  considered  in 
this  way,  is  called  an  ordered  segment. 

V'-l.  (Dedekind’s  postulate)  If  the  points  of  an  ordered  segment  of  origin  A  and 
extremity  B  are  separated  into  two  classes  in  such  a  way  that 

1 .  each  point  of  AB  belongs  to  one  and  only  one  of  the  classes, 

2.  the  points  A  and  B  belong  to  different  classes  ( which  we  shall  respectively  call  the  first 
class  and  the  second  class), 

3.  each  point  of  the  first  class  precedes  each  point  of  the  second  class, 

then  there  exists  a  point  C  on  AB  such  that  every  point  of  AB  which  precedes  C  belongs  to  the 
first  class  and  every  point  of  AB  which  follows  C  belongs  to  the  second  class. 

Considerably  more  was  accomplished  by  Hilbert  in  his  Grundlagen  der 
Geometrie  than  just  the  establishment  of  a  satisfactory  set  of  postulates  for 
Euclidean  geometry.  In  showing  the  logical  consistency  and  partial  inde¬ 
pendence  of  his  postulates,  Hilbert  had  to  devise  many  interesting  models, 
or  interpretations,  for  different  subsets  of  the  postulates.  This  amounted  to 
introducing  various  new  systems  of  geometry  and  to  creating  a  number  of 
unusual  algebras  of  segments.  The  significance  of  several  important  postulates 
and  theorems  in  the  development  of  Euclidean  geometry  is  clearly  shown 
in  the  work,  and  examples  of  various  kinds  of  nontraditional  geometries  are 
illustrated.  For  example,  to  show  the  independence  of  the  postulate  of 
Archimedes  from  the  other  postulates  of  the  treatment,  an  example  of  a 
non-Archimedean  system  is  offered  in  which  all  the  postulates  except  the 
postulate  of  Archimedes  are  shown  to  hold.  There  is  also  developed  in  this 
work  a  theory  of  proportion  and  a  theory  of  areas  which  are  independent 
of  the  postulate  of  Archimedes.  These  accompanying  investigations  by 
Hilbert  virtually  inaugurated  the  twentieth-century  study  of  abstract  geom¬ 
etry  and  successfully  convinced  many  mathematicians  of  the  hypothetico- 
deductive  nature  of  mathematics.  By  implanting  the  postulational  method 
in  nearly  all  of  mathematics  since  1900,  Hilbert’s  Grundlagen  der  Geometrie 
represents  a  definite  landmark  in  the  history  of  mathematical  thought. 

Other  postulational  treatments  of  Euclidean  geometry  followed  Hilbert’s 
effort.  In  1904,  the  American  mathematician  Oswald  Veblen  (1880-1960) 
furnished  a  new  postulate  set  in  which  he  replaced  the  primitive  notion  of 
betweenness,  as  used  by  Peano  and  Hilbert,  by  a  more  pervasive  primitive 
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relation  of  “order.”*  With  this  new  primitive  relation,  the  terms  “line,” 
“plane,”  “on,”  and  “congruent”  can  receive  explicit  definition,  and  thus 
the  list  of  primitive  terms  be  reduced  to  just  two,  namely,  “point”  and 
“order.”  There  is  a  feeling  among  mathematicians  that  the  smaller  one  can 
make  the  number  of  primitive  terms  in  a  postulational  development,  the 
more  aesthetically  pleasing  is  that  postulational  development— a  principle 
that  was  emphasized  by  Peano.  A  second  study  of  the  foundations  of 
Euclidean  geometry  was  made  by  Veblen  in  1911,  in  which  his  original 
treatment  was  slightly  revised  to  accord  with  some  ideas  put  forth  by  R.  L. 
Moore,  f 

A  very  satisfying  combination  of  .the  postulates  of  Hilbert  and  Veblen 
has  been  employed  by  Gilbert  de  B.  Robinson.  |  Robinson’s  postulates  are 
essentially  Veblen’s  postulates  of  order  combined  with  Hilbert’s  postulates 
of  congruence  and  continuity.  In  some  ways  Robinson’s  postulate  set  offers 
improvements  over  the  two  previous  sets,  and  many  students  may  prefei 
the  study  of  this  system  to  that  of  either  Hilbert  or  Veblen. 

After  the  construction  of  Veblen’s  system  of  postulates  for  Euclidean 
geometry,  the  study  became  increasingly  abstract  and  formal.  An  interesting 
example  of  this  tendency  may  be  found  in  a  paper  written  by  E.  V .  Huntington 
(1874-1952)  in  191 3.§  In  this  paper,  Huntington  offers  a  treatment  of  three- 
dimensional  Euclidean  geometry  based  upon  “sphere”  and  “inclusion”  (one 
sphere  lying  within  another)  as  primitive  terms.  This  unusual  approach 
exemplifies  the  fact  that  it  is  possible  to  characterize  Euclidean  geometry  by 
systems  of  postulates  which  are  superficially  very  different  from  one  another. 
Huntington  did  a  great  deal  of  work  with  postulate  systems,  and  we  shall 
meet  his  name  again  when  we  consider  postulational  developments  in  algebra 
and  arithmetic. 

An  excellent  and  detailed  abstract  postulational  examination  of 
Euclidean  geometry  appeared  in  1927  in  a  work  by  Henry  George  Forder.|| 
Here  we  find  many  alternative  postulate  sets  compared  with  one  another. 
For  example,  Forder  considers  nine  different  parallel  postulates,  which  vary 
in  the  strengths  of  their  assumptions.  By  adopting  a  strong  parallel  postulate 
and  by  using  Dedekind’s  postulate  as  a  postulate  of  continuity,  Forder  gives 
a  postulate  set  for  Euclidean  geometry  based  on  only  the  two  primitive 
terms  “point”  and  “order.”  He  also  gives  an  abstract  treatment  of  a  postulate 
set  of  Pieri’s,  and  based  on  the  two  primitive  terms  “point”  and  “congruence. 

*0.  Veblen  [1]. 

fO.  Veblen  [2]. 

fG.  de  B.  Robinson. 

§E.  V.  Huntington  [3];  also  see  G.  de  B.  Robinson,  Appendix,  pp.  157-160. 

||H.  G.  Forder  [1]. 
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A  singularly  elegant  and  concise  postulate  set  for  plane  Euclidean 
geometry,  based  upon  the  two  primitive  terms  “point”  and  “distance,”  was 
published  by  L.  M.  Blumenthal  in  1961.*  The  postulate  set  characterizes 
the  Euclidean  plane  as  a  special  kind  of  metric  space  (a  concept  which  will 
be  considered  in  Chapter  8),  and  is  easily  modified  to  give  a  postulate  set 
for  w-dimensional  Euclidean  geometry  and  for  plane  Lobachevskian  geometry. 

Whether  or  not  it  is  wise  to  attempt  a  postulational  treatment  of 
Euclidean  geometry  at  the  high  school  level  is  a  matter  of  pedagogical 
opinion.  George  Bruce  Halsted  made  an  unsuccessful  effort  in  1904,  when 
he  published  an  elementary  geometry  textbook  based  upon  Hilbert’s  postu¬ 
late  set.  f  More  successful,  and  certainly  worthy  of  examination,  is  an  attempt 
made  in  1940  by  Professors  George  David  Birkhoff  and  Ralph  Beatley  of 
Harvard  University.  J  Here  a  teachable  high  school  course  in  plane  Euclidean 
geometry  is  evolved  from  five  postulates  based  upon  an  ability  to  measure 
line  segments  and  angles.  Although  for  pedagogical  reasons  certain  subtler 
mathematical  and  logical  details  are  either  ignored  or  slurred  over,  the  work 
does  stem  from  a  rigorous  mathematical  presentation  that  had  been  made 
earlier  by  Birkhoff.§  Since  about  the  middle  of  the  twentieth  century  a 
number  of  authors  and  writing  groups  have  attempted  the  task  of  producing 
textual  materials  for  the  high  school  geometry  class  wherein  geometry  is 
developed  rigorously  from  a  postulational  basis.  In  these  attempts  usually 
either  the  Hilbert  postulate  set  or  the  BirkhofT  postulate  set  (sometimes 
slightly  altered  or  augmented)  is  adopted. 

4.3  Poincare's  Model  and  the 

Consistency  of  Lobachevskian 

Geometry 

A  satisfactory  postulate  set  for  plane  Lobachevskian  non-Euclidean 
geometry  may  be  obtained  from  the  Hilbert  postulate  set  given  in  the  last 
section  by  simply  replacing  the  postulate  of  parallels  (Postulate  IV-1)  by 

IV  -1.  Through  a  given  point  A  not  on  a  given  line  m  there  pass  at  least  two 
lines  which  do  not  intersect  line  m. 

Developments  of  plane  Lobachevskian  geometry  have  been  made  using  such 
a  postulate  set  as  a  foundation.  |j  It  is  our  purpose,  in  the  present  section, 
to  show  that  plane  Lobachevskian  geometry  is  as  consistent  as  plane 
Euclidean  geometry.  To  accomplish  this  we  shall  employ  a  model,  or 

*L.  M.  Blumenthal  [2]. 

fG.  B.  Halsted  [1].  Certain  logical  criticisms  of  the  text  were  met  by  Halsted  in  1907 

in  a  thoroughly  revised  second  edition;  this  edition  has  been  translated  into  French. 

JG.  D.  Birkhoff  and  R.  Beatley. 

§G.  D.  Birkhoff  [1]. 

||See,  for  example,  G.  Verriest. 
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representation,  of  plane  Lobachevskian  geometry  that  was  devised  by  Henri 
Poincare.  Our  aim  will  be  accomplished  when  we  show  that  this  model  is 
a  system  of  geometrical  elements  and  relations  which,  when  substituted  for 
the  primitive  terms  in  the  postulates  for  plane  Lobachevskian  geometry, 
convert  these  postulates  into  theorems  in  Euclidean  geometry. 

The  Poincare  model  that  we  shall  employ  may  be  described  as  follows. 
Let  a  fixed  circle  2  (see  Figure  23)  be  selected  and  called  the  fundamental 
circle.  A  point  of  the  Lobachevskian  plane  is  represented  in  the  model  by 


a  point  in  the  interior  of  2,  and  will  hereafter  be  designated,  by  means  of 
boldface  type,  as  a  point.  A  line  of  the  Lobachevskian  plane  is  represented 
in  the  model  by  the  arc  interior  to  2  of  any  circle,  or  straight  line,  which 
cuts  2  perpendicularly,  and  will  hereafter  be  designed  as  a  line.  Boldface 
type  will  be  similarly  employed  whenever  we  wish  to  understand  that  the 
concept  considered  is  a  Lobachevskian,  rather  than  a  Euclidean,  concept. 
The  relationships  of  a  point  on  a  line,  and  of  a  point  between  two  points, 
have,  in  the  model,  the  obvious  interpretations.  To  interpret  suitably 
congruence  of  segments  and  congruence  of  angles  two  definitions  are 
made.  The  (positive)  length  of  a  segment  AB  is  defined  as 

AB  -  log  (. AB,TS ), 

where  S  and  T  are  the  points  in  which  the  circle  containing  the  segment 
AB  meets  2,  A  lying  between  S  and  B ,  and  (AB,  TS)  denotes  the  so-called 
cross  ratio  (AT/BT)(BS/AS).  And  the  measure  of  the  angle  between  two 
intersecting  lines  is  defined  as  the  ordinary  Euclidean  measure  of  the  angle 
between  the  two  circles  or  straight  lines  upon  which  the  lines  lie.  Two 
segments  are  congruent  if  and  only  if  they  have  equal  lengths,  and  two 
angles  are  congruent  if  and  only  if  they  have  equal  measures. 
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We  shall  now  show  that,  with  the  above  interpretations,  the  Hilbert 
postulates  for  plane  Lobachevskian  geometry  become  theorems  in  plane 
Euclidean  geometry. 

We  verify  Postulate  1-1  by  proving  that  there  is  one  and  only  one  circle 
(or  possibly  straight  line)  orthogonal  to  a  given  circle  2  and  passing  through 
two  given  points  A  and  B  lying  inside  2. 

Let  0  (see  Figure  24)  be  the  center  of  2,  and  let  A'  be  the  point  on  OA 


FIG.  24 


produced  such  that  OA  ■  OA'  =  p2,  where  p  is  the  radius  of  2.  We  shall  now 
show  that  the  circle  fl  determined  by  the  three  points  A,  B ,  A'  is  orthogonal 
(that  is,  perpendicular)  to  2.  (If  these  three  points  are  collinear,  circle  IT 
will  degenerate  into  a  diametral  line  of  2,  which  is  certainly  orthogonal  to 
2.)  Let  C  be  the  center  and  r  the  radius  of  IT,  and  let  OC  cut  II  in  Q  and  Q' 
as  indicated  in  the  figure.  Then 

P2  =  OA  •  OA'  =  OQ  •  OQ'  =  (OC  -  r)(OC  +  r)  =  OC 2  -  r2, 
or 

r2  +  p2  =  OC2, 

and  circles  2  and  II  cut  each  other  orthogonally.  Thus  there  is  at  least  one 
circle  through  points  A  and  B  and  cutting  circle  2  orthogonally.  To  show 
that  there  is  only  one  such  circle,  let  II  now  represent  any  circle  through 
A  and  B  and  orthogonal  to  2,  and  let  OA  produced  cut  II  in  point  A'. 
Then,  since 

r2  +  p2  =  OC2, 

we  have 

p2  =  OC2  -  r2  =  (OC  -  r)  (OC  +  r)  =  0(1  •  OQ  =  OA  •  OA', 
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and  A'  is  the  point  on  OA  produced  such  that  OA  ■  OA'  =  p2.  Therefore 
the  circle  II  coincides  with  the  circle  considered  earlier,  and  there  is  only 
one  circle  through  A  and  B  orthogonal  to  2. 

The  reader  can  easily  verify  that  Postulates  1-2,  II-l,  1 1-2,  1 1-3,  II-4, 
III-l,  and  1 1 1-2  are  obviously  satisfied  by  our  interpretations  of  point,  line, 
on,  between,  and  congruence  of  segments. 

The  verification  of  Postulate  1 1 1-3  follows  from  the  fact  that 

AB  =  log  (AB,  TS) 


=  log  ( AC,TS )  +  log  ( CB,TS ) 

=  AC  +  CB. 


Similarly,  A  B'  =  A'C'  +  C'B',  whence,  since  A'C'  =  AC  and  C  B  —  CB, 
we  have  A'B'  =  AB,  and  the  segments  A'B'  and  AB  are  congruent. 


Postulate  1 1 1-4  becomes  obvious  once  we  show  that  there  is  a  unique 
circle  II  orthogonal  to  2,  passing  through  a  given  point  A'  within  2,  and 
tangent  at  A'  to  a  ray  A'R  emanating  from  A'.  This  demonstration  is  easily 
accomplished,  for,  by  virtue  of  our  work  in  connection  with  Postulate  1-1, 
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circle  II  (see  Figure  25)  must  pass  through  point  A"  on  OA'  produced  and 
such  that  OA'  •  OA"  =  p2.  But  there  is  a  unique  circle  passing  through 
A'  and  A"  and  tangent  to  A' R  at  A';  its  center  lies  at  the  intersection  of  the 
perpendicular  to  A'R  at  A'  and  the  perpendicular  bisector  of  segment  A'A". 

Postulates  1 1 1-5  and  V-l  are  easily  seen  to  be  satisfied  by  our  inter¬ 
pretations,  and  Postulate  V-2  is  not  needed  for  a  noncoordinate  development 
of  the  subject.  The  only  postulates  still  remaining  to  be  verified  are,  then, 
Postulate  1 1 1-6  and  Postulate  IV'-l.  The  task  of  verifying  Postulate  1 1 1-6 
is  not  so  easy,  as  it  requires  some  knowledge  of  elementary  college  geometry 
and  in  particular  of  the  transformation  of  inversion.  We  shall  accordingly 
relegate  this  piece  of  verification  to  the  problems  at  the  end  of  the  chapter, 
where  the  required  theory  from  college  geometry  will  be  developed. 

We  shall  conclude,  then,  with  a  verification  of  the  Lobachevskian 
parallel  postulate,  Postulate  IV/-1.  In  Figure  26,  let  A  be  a  given  point 


and  let  m  be  a  given  line  not  passing  through  A.  Let  S  and  T  be  the  points 
where  the  circle  m  cuts  2.  Now  through  S  and  A  there  passes  a  unique  circle 
cutting  2  orthogonally,  and  through  T  and  A  there  passes  a  unique  circle 
cutting  2  orthogonally.  These  circles  are  tangent,  at  S  and  T,  respectively, 
to  circle  m,  and  are  distinct  from  one  another.  Clearly,  any  circle  orthogonal 

to  2,  passing  through  A,  and  lying  within  the  angle  d  indicated  in  the  figure, 
fails  to  cut  line  m. 

We  may  now  consider  that  the  purpose  of  this  section  (to  show  that 
plane  Lobachevskian  geometry  is  as  consistent  as  plane  Euclidean  geometry) 
has  been  accomplished.  For,  should  there  be  any  inconsistency  in  plane 
Lobachevskian  geometry,  there  would  have  to  be  a  corresponding  incon¬ 
sistency  in  the  plane  Euclidean  geometry  of  the  Poincare  model. 
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By  replacing  the  fundamental  circle  2  by  a  fundamental  sphere ,  and  by 
considering  spherical  surfaces  (and  planes)  cutting  this  fundamental  sphere 
orthogonally,  the  Poincare  model  can  be  extended  to  show  that  solid 
Lobachevskian  geometry  is  consistent  if  solid  Euclidean  geometry  is  consistent. 

4.4  Analytic  Geometry 

Every  student  of  mathematics  meets,  early  in  his  college  work,  the 
remarkable  subject  called  analytic  geometry ,  and  he  can  hardly  fail  to  be 
impressed  by  the  powerful  idea  behind  it.  The  essence  of  the  idea  as  applied 
to  the  plane,  it  will  be  recalled,  is  the  establishment  of  a  correspondence 
between  pairs  of  real  numbers  and  points  in  the  plane,  thereby  making 
possible  a  correspondence  between  curves  in  the  plane  and  equations  in 
two  variables,  so  that  for  each  curve  in  the  plane  there  is  a  definite  equation 
f(x,  y)  =  0,  and  for  each  such  equation  there  is  a  definite  curve,  or  set  of 
points,  in  the  plane.  A  correspondence  is  similarly  established  between  the 
algebraic  and  analytic  properties  of  the  equation  f(x,y)  =  0  and  the  geo¬ 
metric  properties  of  the  associated  curve.  The  task  of  proving  a  theorem  in 
geometry  is  cleverly  shifted  to  that  of  proving  a  corresponding  theorem  in 
algebra  and  analysis. 

There  is  no  unanimity  of  opinion  among  historians  of  mathematics 
concerning  who  invented  analytic  geometry,  nor  even  concerning  what  age 
should  be  credited  with  the  invention.  Much  of  this  difference  of  opinion 
is  caused  by  a  lack  of  agreement  regarding  just  what  constitutes  analytic 
geometry.  There  are  those  who,  favoring  antiquity  as  the  era  of  the  invention, 
point  out  the  well-known  fact  that  the  concept  of  fixing  the  position  of  a 
point  by  means  of  suitable  coordinates  was  employed  in  the  ancient  world 
by  the  Egyptians  and  the  Romans  in  surveying,  and  by  the  Greeks  in  map 
making.  And,  if  analytic  geometry  implies  not  only  the  use  of  coordinates 
but  also  the  geometric  interpretation  of  relations  among  coordinates,  then 
particularly  strong  in  the  favor  of  the  Greeks  is  the  fact  that  Apollonius 
derived  the  bulk  of  his  geometry  of  the  conic  sections  from  the  geometrical 
equivalents  of  certain  Cartesian  equations  of  these  curves,  an  idea  which 
seems  to  have  originated  with  Menaechmus  about  350  b.c.  Others  claim 
that  the  invention  of  analytic  geometry  should  be  credited  to  Nicole  Oresme, 
who  was  born  in  Normandy  about  1323,  and  who  died  in  1382  after  a  career 
that  carried  him  from  a  mathematics  professorship  to  a  bishopric.  Oresme, 
in  one  of  his  mathematical  tracts,  anticipated  another  aspect  of  analytic 
geometry  when  he  represented  certain  laws  by  graphing  the  dependent 
variable  against  the  independent  one,  as  the  latter  variable  was  permitted 
to  take  on  small  increments.  Advocates  for  Oresme  as  the  inventor  of  analytic 
geometry  see  in  his  work  such  accomplishments  as  the  first  explicit  intro¬ 
duction  of  the  equation  of  a  straight  line  and  the  extension  of  some  of  the 
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notions  of  the  subject  from  two-dimensional  space  to  three,  and  even  four, 
dimensional  spaces.  *  A  century  after  Oresme’s  tract  was  written,  it  enjoyed 
several  printings,  and  in  this  way  it  may  possibly  have  influenced  later 
mathematicians.  However,  before  analytic  geometry  could  assume  its  present 
highly  practical  form,  it  had  to  await  the  development  of  algebraic  symbol¬ 
ism,  and  accordingly  it  may  be  more  nearly  correct  to  agree  with  the  majority 
of  historians,  who  regard  the  decisive  contributions  made  in  the  seventeenth 
century  by  the  two  French  mathematicians,  Rene  Descartes  (1596-1650) 
and  Pierre  de  Fermat  (1601-1663),  as  the  essential  origin  of  at  least  the 
modern  spirit  of  the  subject.  Not  until  after  the  great  impetus  given  to  the 
subject  by  these  two  men  do  we  find  analytic  geometry  in  a  form  with 
which  we  are  familiar. 

In  a  history  of  mathematics  a  good  deal  of  space  would  be  devoted  to 
Descartes  and  Fermat,  for  these  men  left  very  deep  imprints  on  the  subject. 
Also,  in  a  history  of  mathematics,  much  would  be  said  about  the  importance 
of  analytic  geometry,  not  only  for  the  development  of  geometry  and  for  the 
theory  of  curves  and  surfaces  in  particular,  but  as  an  indispensable  force  in 
the  development  of  the  calculus  and  as  an  influential  power  in  molding  our 
ideas  of  such  far-reaching  concepts  as  those  of  “function”  and  “dimension.” 
In  the  present  section,  however,  we  shall  restrict  ourselves  to  the  relations 
of  analytic  geometry  with  the  axiomatic,  or  postulational,  method;  these 
relations  will  prove  to  be  both  interesting  and  enlightening,  and  they  may 
clear  away  some  of  the  confusion  regarding  just  what  constitutes  analytic 
geometry. 

In  the  usual  college  textbooks  of  plane  analytic  geometry,  use  is  made 
of  various  definitions  and  theorems  of  Euclidean  geometry  in  order  to  set 
up  a  coordinate  system  in  which  a  point  is  represented  by  an  ordered  pair 
of  real  numbers.  Similar  use  of  some  definitions  and  theorems  of  Euclidean 
geometry  is  made  in  proving  that  a  straight  line  is  represented  by  a  linear 
equation  in  two  variables,  in  deriving  a  formula  for  the  distance  between 
two  points,  and  so  on.  In  such  a  development,  where  the  geometry  itself 
is  employed  in  setting  up  a  coordinate  machinery  for  the  study  of  the 
geometry,  a  student  might  well  wonder  whether  the  methods  of  plane  analytic 
geometry  are  adequate  for  the  establishment  of  any  theorem  which  is 
implied  by  the  postulates  of  plane  Euclidean  geometry.  The  answer  to  this 
question  will  be  in  the  affirmative,  and  the  blemish  of  assuming  tidbits  of 
plane  Euclidean  geometry  in  order  to  set  up  the  coordinate  machinery  will 
be  removed,  if  we  can  obtain,  for  example,  an  algebraic  interpretation  of 
the  Hilbert  postulate  set  for  plane  geometry  given  in  Section  4.2.  We  shall 
now  do  this. 

Our  task  is  to  assign  algebraic  meanings  to  the  five  primitive  terms, 
point ,  line ,  on ,  between ,  and  congruent  (as  applied  to  pairs  of  points  and  to 

*See  P.  Duhem. 
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angles),  which  will  convert  the  postulates  of  the  Hilbert  set  into  true  algebraic 
statements.  It  should  be  clear  that  such  a  step  merely  provides  what  may  be 
regarded  as  an  alternative  interpretation  of  Euclidean  geometry;  certainly 
the  result  is  just  as  “Euclidean”  as  the  specific  interpretation  studied  in  high 
school.  To  accomplish  our  present  purpose,  therefore,  we  make  the  following 
assignments: 

1 .  By  a  point  we  shall  mean  any  ordered  pair  of  real  numbers;  these  numbers 
will  be  called  the  coordinates  of  the  point. 

2.  By  a  line  we  shall  mean  any  equation  in  the  two  variables  x  and  y  of 
the  form  ax  +  by  -f-  c  =  0,  where  a ,  b,  c  are  real  numbers  and  a  and  b 
are  not  both  0.  Two  such  linear  equations  in  x  andy,  whose  left  members 
differ  at  most  by  only  a  constant  nonzero  factor,  will  represent  the  same 
line,  and  either  equation  will  be  called  an  equation  of  the  line. 

3.  We  shall  say  a  point  is  on  a  line  if  and  only  if  the  coordinates  of  the 
point  satisfy  an  equation  of  the  line. 

4.  We  shall  say  the  point  (x,  y)  is  between  the  points  (xi,  y\)  and  (x2,  y2) 
if  and  only  if  there  is  a  real  number  t,  greater  than  0  and  less  than  1, 
such  that 

x  =  (1  —  t)x i  -f  /x2  and  y  =  (1  —  t)yi  +  ty2. 

5.  We  shall  say  the  pair  of  points  (xi,  yi),  (x2,  y2)  is  congruent  to  the  pair 
of  points  (x3,  y3),  (x4,  y\)  if  and  only  if 

(x2  —  xi)2  +  (y2  —  yi)2  =  (*4  —  *3)2  +  (j^4  —  jG2- 

We  shall  call  the  left  member  of  this  equation  the  square  of  the  distance 
between  the  points  (xi,  y\)  and  (x2,  y2). 

6.  Finally,  we  shall  say  the  angle,  denoted  by  (x2,  y2)  (xi,  y i)  (x3,  y3),  is 
congruent  to  the  angle,  denoted  by  (x2,y2)  (xi \yi)  (x3',y'3),  if  and  only  if 

(x2  —  xi)  (x3  —  xi)  +  (y2  —  y\)  (y3  —  y\) 

V(x2  —  xi)2  +  (y2  —  yi)2  V(x3  —  xi)2  +  (y3  —  yi)2 

_ (x2'  —  x\)  (x3  —  x/)  +  (y2  —  yi)  {yz  ~  yi) _ 

V  (x2;  —  x/)2  +  (>'2'  —  y\)2  —  x/)2  +  (ys  —  yi)2 

We  shall  call  the  left-hand  member  of  this  equation  the  cosine  of  the 
angle  (x2,_y2)  (xi,^i)  (*3,^3). 

It  should  be  kept  in  mind  that  the  names  point ,  line ,  distance ,  and  cosine 
are  applied  to  certain  algebraic  entities,  and  the  words  on,  between ,  and 
congruent  express  certain  relations  among  our  algebraic  entities.  With  the 
acceptance  of  these  algebraic  interpretations  of  the  primitive  terms  of 
Hilbert’s  postulate  set  we  may  now  convert  each  postulate  of  the  set  into 
an  algebraic  statement.  It  can  be  shown,  by  the  methods  of  algebra  alone, 
that  these  are  true  algebraic  statements.  Let  us  illustrate  how  this  is  done. 
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To  verify  Postulate  1-1  in  our  interpretation  we  must  show  that  there 
is  a  unique  (to  within  a  constant  factor)  equation  in  *  and  y  of  the  form 

ax  +  by  c  =  0, 

where  a,  b,  c  are  real  and  a  and  b  are  not  both  0,  which  is  satisfied  by  two 
distinct  pairs  of  real  values  (xi,  y\)  and  (x2i  y2)  of  the  variables  x  and  y. 
Substitution  of  (x\,  y i)  and  (x2l  y 2)  for  (*,  y)  in  the  equation 

(y  2  —  y  i)x  —  (*2  —  xi  )y  +  (x2y  1  —  xy2)  =  0  (1) 

shows  that  this  equation  is  satisfied  by  the  pairs  of  values  (xi,yi)  and  (x2,y2). 
Also,  since  (xhyi)  and  (x2,y2)  are  distinct  pairs  of  numbers,  either y2  -  y  1  ^  o 
or  x2  —  xi  0;  moreover,  since  xhyh  x2,y2  are  real,  the  coefficients  in  the 
equation  are  real.  We  have  thus  shown  that  there  is  an  equation  of  the 
desired  form  satisfied  by  the  pairs  of  values  (xi,yi)  and  (x2,y2).  We  must 
now  show  that,  to  within  a  constant  nonzero  factor,  this  is  the  only  equation 
of  the  desired  form  satisfied  by  the  distinct  pairs  of  values  (xi,yi)  and 
(x2,y2).  To  this  end,  suppose  (xi,yi)  and  (*2,^2)  satisfy  the  equation 


ax  +  by  +  c  =  0,  (2) 

where  a ,  b,  c  are  real  and  a  and  b  are  not  both  0.  Then  we  have 

ax  1  +  byi  +  c  =  0  and  ax2  +  by2  +  c  =  0,  (3) 

or,  by  subtraction, 

a{x  2  —  *1)  +  b(y2  —  y  1)  =  0.  (4) 

Now  suppose  a  0.  Then  equations  (3)  become 

x\  +  Byi  +  C  =  0  and  x2  +  By2  +  C  =  0,  (5) 

where  B  =  b/a  and  C  =  c/a.  Since  a  ^  0,  we  cannot  havej^2  ~  y  1  =  0, 


for  otherwise  (4)  would  reduce  to  a(x2  —  xi)  =  0  or  x2  —  xi  =  0,  a  situation 
which  is  impossible.  Solving  equations  (5)  simultaneously  for  B  and  C,  we 
now  find 


whence 


B  = 


x2  —  Xi 
J2  -  yi 


x2yi  —  x^2 

J>2  ~  yi 


a  :  b  :  c  =  (y2  —  yi)  :  —  (x2  —  Vi)  :  (x2y  1  —  xy2), 


and,  except  for  a  possible  constant  nonzero  factor,  equation  (2)  becomes  our 
equation  (1).  A  similar  argument  can  be  carried  out  if,  instead  of  supposing 

a  0,  we  suppose  b  ^  0.  Thus  Postulate  1-1  becomes,  in  our  interpretation, 
a  theorem  of  algebra. 

Postulate  1-2  is  easily  shown  to  be,  in  our  interpretation,  a  theorem  of 
algebra. 
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To  verify  Postulate  II-l  in  our  interpretation,  we  note  that  no  matter 
what  real  value  t  takes  on,  the  point  (x,y)  given  by 

*  =  (1  —  0*i  +  txh  y  =  0  —  t)y\  +  ty2,  (6) 

satisfies  the  equation  (1)  of  the  line  determined  by  the  points  (xi,yi)  and 
(^2,^2).  This  shows  that,  the  points  (x,y),  (*i,j>i),  {x2,y2)  are  on  the  same 
line.  Further,  by  setting  s  =  \  —  t,  we  find 

X  =  (1  —  s)x  2  +  sx  1,  y  =  (1  —  s)y2  +  ^1.  (7) 

Now  suppose  t  is  between  0  and  1.  Then  j  is  also  between  0  and  1,  and 
equations  (7)  guarantee  that  point  (x,y)  is  between  points  (x2,y2)  and 
(*1,^1)  •  Solving  equations  (6)  for  x2  andjy2,  we  obtain 

->;‘2  =  +  (1  -  7 )». 

y2  =  ry  +  (1  -  r)y  1, 

where  r  is  not  between  0  and  1.  Hence  point  (x2,y2)  is  not  between  points 
(x,y)  and  (xhyi).  We  may  similarly  show  that  point  (xhyi)  is  not  between 
points  (x,y)  and  (x2,y2). 

Postulate  II-2  is  readily  verified  by  taking,  in  equations  (6),  t  =  1/2  and 
t  =  2  in  turn.  In  the  first  case  we  obtain  a  point  (x,y)  between  the  points 
(xhyi)  and  (*2,J>2).  In  the  second  case  we  obtain  a  point  (x,y)  such  that 
point  (. x2,y2 )  is  between  points  (xi,yi)  and  (x,y). 

Perhaps  enough  has  been  done  to  indicate  how  the  interpretations  of  the 
various  postulates  of  Hilbert’s  set  can  be  shown,  by  algebraic  methods  alone, 
to  be  theorems  of  algebra.  Some  of  the  proofs  are  very  simple,  such  as  that 
for  Postulate  III-2,  whereas  others  are  more  involved,  such  as  that  for 
Postulate  1 1-4.  We  might  point  out  that  if,  in  the  parallel  postulate  (Postulate 

IV- 1),  A  is  the  point  (xi,yi)  and  m  is  the  line  ax  +  by  +  c  =  0,  then  the 
unique  line  through  A  which  does  not  intersect  m  is  given  by 

ax  +  by  —  {ax  1  +  by  1)  =  0. 

Also,  it  is  easy  to  show  that  our  interpretations  of  point ,  line ,  and  on  imply 
a  one-to-one  correspondence  between  the  points  of  any  line  and  the  set  of 
all  real  numbers,  and  this,  as  we  have  pointed  out,  is  essentially  Postulate 

V- 2.  Actually,  there  are  points  having  coordinates  belonging  to  a  suitable 
subset  of  the  set  of  all  real  numbers  which  satisfy  all  the  rest  of  Hilbert’s 
postulates,  and  such  a  subset  of  the  real  numbers  would  be  sufficient  to  give 
an  algebraic  interpretation  of  these  postulates.  However,  since  we  desire 
to  deal  with  all  real  numbers,  Postulate  V-2  is  implied.  To  carry  out  the 
program  of  verifying  the  truth  of  the  interpretations  of  all  the  postulates  of 


1 


1 


or 


*2  =  I  -Jx  +  (  1  —  -  )*i, 


x2  =  rx  +  (1  —  r)x  1, 
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the  Hilbert  set  would  occupy  too  much  space,  and  so  we  shall  drop  the  matter 
at  this  point  and  invite  the  interested  reader  to  supply  the  remaining  proofs.  * 

Let  us  now  complete  this  section  on  analytic  geometry  by  stating  briefly 
what  has  been  accomplished  by  setting  up  our  algebraic  model,  or  inter¬ 
pretation,  of  the  Hilbert  postulate  set  for  plane  Euclidean  geometry. 

In  the  first  place,  we  have  shown  that  plane  Euclidean  geometry  and, 
along  with  it,  plane  Lobachevskian  geometry  are  as  consistent  as  the  algebra 
of  real  numbers.  The  consistency  of  other  branches  of  mathematics  can 
similarly  be  shown  to  depend  upon  the  consistency  of  this  algebra.  It  would 
seem  important,  then,  for  us  to  take  up  a  study  of  the  real  number  system. 
In  a  later  chapter  we  shall  do  this,  and  we  shall  there  consider  a  postulational 
treatment  of  the  algebra  of  real  numbers  and  also  take  up  the  question  of 
the  consistency  of  this  algebra. 

In  the  second  place,  we  have  perhaps  clarified  the  meaning  of  analytic 
geometry.  Plane  (Cartesian)  analytic  geometry  is  not  to  be  confused  with  the 
algebraic  model  that  we  have  set  up  of  plane  Euclidean  geometry.  It  is, 
rather,  the  translation  process  whereby  we  use  the  algebraic  model  in  order 
to  solve  problems  and  establish  theorems  in  our  geometry.  Analytic  geometry, 
then,  is  not  a  branch  of  geometry,  but  rather  a  method  of  geometry.  It  amounts 
to  thinking  through  a  geometric  study  by  employing  a  different  set  of  mental 
images  in  a  different  language,  and  then  translating  back  into  the  original 
geometric  form  the  results  of  this  thinking.  Such  is  the  powerful  idea  behind 
analytic  geometry;  it  permits  us  to  employ  a  field  of  study  in  which  we  may 
be  more  facile  in  order  to  obtain  information  about  another  quite  distinct 
field  of  study  in  which  we  may  be  less  facile.  A  person  not  too  familiar  with 
a  foreign  language  resorts  to  just  such  a  method;  being  asked  a  question 
in  a  foreign  language,  he  is  likely  first  to  translate  the  question  into  his  native 
language,  think  out  an  answer  in  his  native  language,  and  then  translate 
back  into  the  foreign  language  the  answer  which  he  has  constructed.  Since 
so  many  students  are  considerably  more  able  as  algebraists  than  as  geometers, 
analytic  geometry  has  been  described  as  the  “royal  road  in  geometry”  that 
Euclid  thought  did  not  exist,  f 

The  idea  behind  analytic  geometry,  namely,  that  of  using  a  model  of  a 
system  in  order  to  establish  consequences  of  the  system,  can  be  strikingly 
illustrated  by  the  Poincare  model  of  plane  Lobachevskian  geometry  consid¬ 
ered  in  the  previous  section.  Since  the  Poincare  model  is  constructed  within 
Euclidean  geometry,  and  since  we  are  so  much  more  familiar  with  Euclidean 
rather  than  Lobachevskian  geometry,  it  is  very  conceivable  that  some 
theorems  of  plane  Lobachevskian  geometry  may  be  more  readily  established 

*See,  for  example,  L.  P.  Eisenhart,  Appendix  to  Chap.  I,  pp.  279-292;  also  C.  J. 

Keyser  [1],  Lecture  VI,  pp.  86-103. 

fWhen  asked  by  King  Ptolemy  for  a  short  cut  to  geometric  knowledge,  Euclid  is 

said  to  have  replied,  “There  is  no  royal  road  in  geometry.” 
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by  proving  the  corresponding  Euclidean  theorems  in  the  Poincare  model 
than  by  proving  the  theorems  themselves  directly  from  a  postulate  set  for 
Lobachevskian  geometry.  * 

Just  as  there  is  an  algebraic  model  of  plane  Euclidean  geometry,  and 
hence  the  method  of  analytic ,  or  algebraic ,  geometry ,  so  also  is  there  a  geometric 
model  of  the  algebra  of  real  numbers,  and  hence  a  method  of  geometric 
algebra.  The  ancient  Greeks  established  a  good  deal  of  elementary  algebra, 
including  the  solution  of  quadratic  equations,  by  essentially  using  a  geometric 
model  in  this  way.f 

4.5  Projective  Geometry  and  the 

Principle  of  Duality 

After  the  discovery  of  non-Euclidean  geometry,  a  handicap  to  certain 
additional  developments  in  the  field  of  geometry  was  the  general  belief 
that  mathematics  is  basically  concerned  with  magnitude.  Even  at  the  present 
time  popular  thinking  upon  mathematics  pertains  largely  to  magnitudes; 
in  fact,  one  dictionary  J  defines  mathematics  as  “that  science  which  treats 
of  the  exact  relations  existing  between  quantities  or  magnitudes.”  This 
point  of  view  is  somewhat  surprising,  for  even  in  antiquity  there  were 
geometrical  studies  which  considered  only  relationships  of  mutual  position 
and  order  of  figures,  and  which  did  not  depend  at  all  upon  relations  involving 
equalities  or  inequalities  of  magnitudes.  And  in  the  early  seventeenth  century 
a  small  group  of  French  mathematicians  actually  considerably  developed 
the  nonmetrical  aspects  of  geometry.  The  motivator  among  these  French 
mathematicians  was  Gerard  Desargues,  an  engineer  and  architect,  who  was 
born  in  Lyons  in  1593  and  who  died  in  the  same  city  about  1662.  Perhaps 
influenced  by  the  growing  needs  of  artists  and  architects  for  a  satisfactory 
theory  of  perspective,  Desargues  published,  in  Paris  in  1639,  a  remarkably 
original  treatise  on  the  conic  sections  which  exploited  the  nonmetrical  idea 
of  projection.  But  this  work  was  so  neglected  by  most  other  mathematicians 
of  the  time  that  it  was  soon  forgotten  and  all  copies  of  the  publication 
disappeared.  Two  centuries  later,  when  the  French  geometer  Michel  Ghasles 
(1793-1880)  wrote  a  history  of  geometry,  there  was  no  way  to  estimate  the 
value  of  Desargues’s  work.  Six  years  later,  however,  in  1845,  Ghasles  hap¬ 
pened  upon  a  manuscript  copy  of  Desargues’s  treatise,  made  by  one  of 
Desargues’s  few  followers,  and  since  that  time  the  work  has  been  regarded 
as  one  of  the  classics  in  the  early  development  of  projective  geometry. 

There  are  several  reasons  for  the  initial  neglect  of  Desargues’s  little 
volume.  It  was  overshadowed  by  the  more  supple  analytic  geometry  intro- 

*See,  for  example,  EE  S.  Carslaw,  Chap.  8;  also  EE  Eves  and  V.  E.  Hoggat  r. 

fSee,  for  example,  H.  Eves,  Sections  3-6  and  3-7. 

f  Webster’s  Collegiate  Dictionary ,  5th  ed. 
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duced  by  Descartes  two  years  earlier.  Geometers  were  generally  either 
developing  this  new  powerful  tool  or  trying  to  apply  infinitesimals  to 
geometry.  Also,  Desargues  unfortunately  adopted  a  style  of  writing  that 
was  so  eccentric  that  it  beclouded  his  work  and  discouraged  others  from 
attempting  properly  to  evaluate  his  accomplishments. 

The  reintroduction  of  nonmetrical  considerations  into  geometry  did  not 
occur  until  the  nineteenth  century,  with  the  work  in  this  field  by  Jean  Victor 
Poncelet  (1788-1867).  As  a  Russian  prisoner  of  war,  taken  during  Napoleon’s 
retreat  from  Moscow,  and  with  no  books  at  hand,  Poncelet  planned  his  great 
work  on  projective  geometry,  which,  after  his  release  and  return  to  France, 
he  published  in  Paris  in  1822.*  This  work  gave  tremendous  impetus  to  the 
study  of  the  subject  and  inaugurated  the  so-called  “great  period”  in  the 
history  of  projective  geometry.  There  followed  into  the  field  a  host  of  French 
and  German  mathematicians,  among  whom  were  Gergonne,  Brianchon, 
Chasles,  Pliicker,  Steiner,  and  Staudt— great  names  in  the  history  of  geom¬ 
etry,  and  in  the  history  of  projective  geometry  in  particular.  It  is  interesting 
to  note  that  at  approximately  the  same  time  that  non-Euclidean  geometry 
was  being  developed  it  was  first  generally  appreciated  that  geometry  could 
no  longer  be  viewed  exclusively  as  a  study  of  quantity. 

The  work  of  Desargues  and  of  Poncelet,  and  their  followers,  led 
geometers  to  classify  geometric  properties  into  two  categories,  the  metric 
properties ,  in  which  the  measure  of  distances  and  of  angles  intervenes,  and 
the  descriptive  properties ,  in  which  such  measure  is  unessential.  The  Pythag¬ 
orean  theorem,  that  the  square  on  the  hypotenuse  of  a  right  triangle  is  equal  to 
the  sum  of  the  squares  on  the  two  legs ,  is  a  metric  property.  As  an  example  of  a 


descriptive  property  we  might  mention  the  remarkable  “mystic  hexagram” 
theorem  of  Blaise  Pascal  (1623—1662),  which  was  inspired  by  the  work  of 
Desargues:  If  a  hexagon  be  inscribed  in  a  conic ,  then  the  points  of  intersection  of 

*J.  V.  Poncelet. 
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the  three  pairs  of  opposite  sides  are  collinear ,  and ,  conversely ,  if  the  points  of  inter¬ 
section  of  the  three  pairs  of  opposite  sides  of  a  hexagon  are  collinear ,  then  the  hexagon 
is  inscribed  in  a  conic  (see  Figure  27). 

The  distinction  between  the  two  types  of  geometric  properties,  at  least 
in  the  case  of  plane  figures,  becomes  clearer  when  viewed  from  the  fact  that 
the  descriptive  properties  are  unaltered  when  the  figure  is  subjected  to  a 
central  projection,  whereas  the  metric  properties  may  no  longer  hold  when 
the  figure  is  so  projected.  Thus,  under  a  central  projection  from  one  plane 
to  another,  a  right  triangle  does  not  necessarily  remain  a  right  triangle,  and 
so  the  Pythagorean  relation  does  not  necessarily  hold  for  the  projected 
figure;  the  Pythagorean  theorem  is  a  metric  theorem.  In  the  case  of  Pascal  s 
theorem,  however,  a  hexagon  inscribed  in  a  conic  projects  into  a  hexagon 
inscribed  in  a  conic  and  collinear  points  project  into  collinear  points,  and 
hence  the  theorem  itself  is  preserved;  Pascal’s  theorem  is  a  descriptive 
theorem. 

Many  descriptive  properties  present  themselves  in  the  seeming  form  of 
metric  properties.  For  example,  if  we  have  four  points  A,  B,  T,  S  on  a  line, 
the  value  of  the  compound  ratio,  (AT /BT) /(AS/BS),  would  seem  to  be  a 
metric  relation.  The  value  of  this  compound  ratio,  however,  can  be  shown  to 
be  unaltered  when  the  line  containing  the  four  points  A,  B,  T,  S  is  centrally 


FIG.  28 

projected  (as  in  Figure  28)  into  another  line  and  the  four  points  A  ,  B  , 
r,  5'.  In  other  words,  although  the  lengths  of  the  various  corresponding 
segments  on  the  two  lines  are  not  necessarily  ec[ual  to  one  another,  the  two 
compound  ratios 

A'T'/B'T '  AT/BT 

A'  S' jB'  S'  A  S/B  S' 

are  equal  in  value.  The  value  of  the  compound  ratio  (AT/BT) /(AS/ BS)  is 
the  so-called  cross  ratio  (AB.1S)  of  the  four  collinear  points  A,  B,  T,  S 
(in  this  order),  which  we  considered  in  Section  4.3  in  connection  with  our 
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study  of  the  Poincare  model.  The  cross  ratio  of  four  collinear  points  is  a 
descriptive  property  of  those  points. 

The  study  of  the  descriptive  properties  of  geometric  figures  is  known 
as  projective  geometry . 

In  plane  projective  geometry,  there  is  a  remarkable  symmetry  between 
points  and  lines  (when  ideal  elements  at  infinity  are  utilized),  such  that  if 
in  a  true  descriptive  proposition  about  “points”  and  “lines”  we  should 
interchange  these  two  words,  and  perhaps  smooth  out  the  language,  we 
obtain  another  true  descriptive  proposition  about  “lines”  and  “points.” 
As  a  simple  example,  consider  the  following  two  propositions  related  in 
this  way: 

Any  two  distinct  points  determine  one  and  only  one  line  on  which  they  both  lie. 

Any  two  distinct  lines  determine  one  and  only  one  point  through  which  they  both 
pass. 

This  symmetry,  which  results  in  the  pairing  of  the  propositions  of  plane 
projective  geometry,  is  a  principle  of  far-reaching  consequence  known  as 
the  principle  of  duality ,  and  was  perhaps  first  clearly  stated  by  Joseph-Diez 
Gergonne  (1771-1859).  Once  the  principle  of  duality  is  established,  then  the 
proof  of  one  proposition  of  a  dual  pair  carries  with  it  the  proof  of  the  other. 

Let  us  dualize  Pascal’s  theorem.  We  first  restate  Pascal’s  theorem  in 
a  form  which  is  perhaps  more  easily  dualized. 

The  six  points  1,  2,  3,  4,  5,  6  lie  on  a  conic  if  and  only  if  the  points  determined 
by  the  three  pairs  of  lines  (12),  (45);  (23),  (56);  (34),  (61)  lie  on  a  line. 

Dualizing  this,  we  obtain 

The  six  lines  1,  2,  3,  4,  5,  6  lie  on  (that  is,  are  tangent  to)  a  conic  if  and 
only  if  the  lines  determined  by  the  three  pairs  of  points  (12),  (45);  (23),  (56); 
(34),  (61)  lie  on  (that  is,  intersect  in)  a  point. 

Using  less  artificial  language,  Pascal’s  theorem  and  its  dual  may  now  be 
stated  as 

A  hexagon  is  inscribed  in  a  conic  if  and  only  if  the  points  of  intersection  of  the 
three  pairs  of  opposite  sides  are  collinear. 

A  hexagon  is  circumscribed  about  a  conic  if  and  only  if  the  lines  joining  the  three 
pairs  of  opposite  vertices  are  concurrent. 

The  dual  of  Pascal’s  theorem  was  first  published  by  C.  J.  Brianchon  (1785- 
1864),  when  a  student  at  the  Ecole  Polytechnique  in  Paris,  in  1806,  nearly 
two-hundred  years  after  Pascal  had  stated  his  theorem.  Brianchon’s  theorem 
is  illustrated  in  Figure  29. 
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Poncelet  maintained  that  the  principle  of  duality  is  a  consequence  of 
the  theory  of  poles  and  polar s,  which,  though  known  earlier,  was  first  developed 
by  him  in  a  systematic  manner.  Toward  explaining  this  theory,  let  T  (see 
Figure  30)  be  a  given  conic.  To  each  point  P  in  the  plane  of  T  can  be  made 
to  correspond  a  line  p  in  the  plane  of  T  in  the  following  way.  If  P  is  on  T, 
take  the  tangent  to  T  at  P  as  the  associated  line  p.  If  P  is  not  on  T,  draw 
through  P  two  lines  cutting  T  in  the  points  A  and  A',  B  and  B',  respectively. 
Then  the  line  p  is  the  line  determined  by  the  points  of  intersection  of  AB 
and  A'B',  AB'  and  A'B.  It  can  be  shown  that  to  a  point  P  there  corresponds, 
in  this  way,  a  unique  line  /?,  and  conversely,  to  a  line  p  there  corresponds 
a  unique  point  P.  The  point  P  is  called  the  pole  of  the  line  p,  and  the  line  p 
is  called  the  polar  of  the  point  P. 


P 


FIG.  30 

This  correspondence  between  the  points  P  and  the  lines  p  of  the  plane 
of  T  possesses  some  important  properties.  First  of  all,  the  correspondence  is 
invariant  under  projection.  Further,  if  p  is  the  polar  of  a  point  P,  the  polar 
of  any  point  P'  on  p  passes  through  the  point  P.  It  follows  that  if  some  points 
are  collinear,  then  the  polars  of  these  points  are  concurrent. 
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Now  imagine  that  we  have  established  a  projective  property  about  a 
plane  figure  F  composed  of  points  and  lines.  Let  F'  be  the  figure  which  one 
obtains  by  replacing  the  points  and  lines  of  F  by  their  polars  and  poles  with 
respect  to  a  given  conic  T  lying  in  the  plane  of  F.  Then  one  will  obtain  from 
the  projective  property  of  figure  F  a  corresponding  projective  property  of 
figure  F',  in  which  the  roles  played  by  the  words  “point”  and  “line”  have 
been,  interchanged.  The  two  properties  will  be  the  duals  of  each  other. 

The  preceding  account  of  plane  projective  geometry  and  of  its  principle 
of  duality  is  that  of  the  earlier  days  of  the  subject’s  development.  It  must 
be  confessed  that  there  is  much  wanting  in  this  early  treatment.  For  example, 
the  definition  of  plane  projective  geometry,  involving,  as  it  does,  the  dis¬ 
tinction  between  metric  properties  and  descriptive  properties,  lacks  a  desired 
precision  and  makes  projective  geometry  in  some  ways  subservient  to 
Euclidean  geometry.  Also,  one  feels  that  the  principle  of  duality  needs  a 
more  general  handling  than  is  given  by  Poncelet’s  treatment,  and  there  are 
objections  to  the  use  of  ideal  elements  at  infinity.  These  faults  have  been 
remedied  in  more  recent  times,  and  we  now  have  at  least  two  quite  acceptable 
and  self-contained  approaches  to  the  subject.  One  of  these  approaches  is 
based  upon  analytic  geometry,  and  will  be  indicated  in  the  problems  at  the 
end  of  the  chapter.  For  the  present  we  shall  consider  briefly  a  very  satisfying 
approach  based  upon  the  modern  postulational  method.  Our  procedure  will 
be  to  put  down  a  set  of  postulates,  and  then  simply  define  plane  projective 
geometry  as  the  deductive  consequences  of  the  system  of  postulates.  If  these 
postulates,  which  will  be  statements  about  certain  primitive  entities  called 
“points”  and  “lines,”  can  be  shown  to  imply  the  dual  of  each  postulate, 
then  the  principle  of  duality  for  plane  projective  geometry  follows,  because 
the  dual  of  any  theorem  that  has  been  derived  from  the  postulate  set  may 
be  established  by  simply  dualizing  the  steps  in  the  proof  of  the  original 
theorem. 

There  are  many  postulate  sets  for  plane  projective  geometry  that  we 
could  give,  but  the  following,  *  in  which  each  postulate  is  self-dual,  most 
neatly  carries  out  our  present  purpose.  Here  “point,”  “line,”  and  “on”  are 
primitive  terms. 

I.  7  here  is  one  and  only  one  line  on  every  two  distinct  points ,  and  one  and  only  one 
point  on  every  two  distinct  lines. 

II.  7 here  exist  two  points  and  two  lines  such  that  each  of  the  points  is  on  just  one 

of  the  lines  and  each  of  the  lines  is  on  just  one  of  the  points. 

III.  There  exist  two  points  and  two  lines ,  the  points  not  on  the  lines ,  such  that  the 

point  on  the  two  lines  is  on  the  line  on  the  two  points. 

The  fact  that  each  postulate  is  self-dual  saves  us  the  task  of  deriving  as 
theorems  the  system  of  dual  statements,  and  automatically  vouchsafes  us 

*See  K.  Menger  [3],  and  H.  S.  M.  Coxeter  [2],  p.  14. 
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the  principle  of  duality  for  plane  projective  geometry.  The  idea  of  a  principle 
of  duality  and  of  a  self-dual  postulate  set  is  not  peculiar  to  projective  geom¬ 
etry;  we  shall  meet  this  situation  again,  in  our  study  of  Boolean  algebra. 
The  aspect  of  a  principle  of  duality  which  is  of  concern  to  us  is  that  the 
proofs  of  two  dual  propositions  are  precisely  the  same  when  proper  changes 
are  made  in  those  terma  which  possess  duals.  In  other  words,  the  deductive 
organizations  set  up  in  proving  dual  propositions  are  essentially  the  same 
if  we  ignore  the  superficial  difference  in  language.  Thus,  if  the  dual  terms 
are  replaced  by  undefined  symbols,  it  is  immediately  apparent  that  the 
original  system  of  propositions  and  the  dual  system  are  identical.  In  the 
concept  of  duality,  therefore,  we  find  the  idea  reinforced  that  mathematical 
studies  pertain  essentially  to  the  treatment  of  symbolic  structures,  and  the 
study  of  abstract  relations  between  symbols  is  perhaps  more  important  to 
mathematics  than  the  consideration  of  possible  meanings  which  may  be 
associated  with  the  symbols. 

It  is  a  significant  fact  that  the  postulates  for  plane  projective  geometry 
given  above  do  not  require  that  the  number  of  points  in  our  geometry  be 
infinite.  In  fact,  consider  the  situation  where  there  are  seven  “points,” 
namely,  the  letters  A,  B ,  C,  D,  E ,  F,  G,  and  seven  “lines,”  namely,  the  letter 
trios  (AFB),  ( BDC ),  (CEA),  ( AGD ),  (BGE),  ( CGE ),  (DEF).  Postulate  I  is 
easily  verified  by  considering  each  pair  of  “points”  and  each  pair  of  “lines.” 
Postulate  II  is  verified  by  the  two  “points”  B ,  C  and  the  two  “lines”  (AFB), 
(AEG).  Postulate  III  is  verified  by  the  two  “points”  B,  C  and  the  two  “lines” 
(AGD),  (DEF). 

A  projective  geometry  containing  only  a  finite  number  of  distinct  points 
is  called  a  finite  projective  geometry.  Our  model  establishes  the  existence  of  a 
finite  plane  projective  geometry.  It  also  shows  that  our  postulates  are 
consistent  with  one  another,  for  an  inconsistency  among  the  postulates  would 
render  a  concrete  model  of  them  impossible.  Again,  the  model  furnishes 
additional  evidence  for  the  argument  that  a  mathematical  system  deduced 
from  a  set  of  postulates  is  independent  of  any  meanings  which  may  be 
associated  with  the  primitive  terms  employed  in  the  postulates;  the  same 
system  may  be  capable  of  a  great  variety  of  interpretations.  And,  finally, 
it  shows  that  if  we  wish  to  assure  that  our  plane  projective  geometry  resemble 
more  closely  the  earlier  historical  concept  of  the  subject  we  shall  have  to 
introduce  further  postulates.  In  fact,  by  gradually  adding  appropriate  further 
postulates  and  introducing  certain  modifications  it  is  possible  to  convert  the 
postulate  set  for  plane  projective  geometry  into  a  postulate  set  for  plane 
Euclidean  geometry,  passing  through  postulate  sets  for  various  intermediate 
geometries  on  the  way.  Such  a  passage  from  projective  geometry  to  Euclidean 
geometry  is  an  interesting  but  lengthy  procedure  which  we  must  forego.  * 

*For  such  a  program  see,  for  example,  G.  de  B.  Robinson,  or  B.  E.  Meserve  [2]. 
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We  also  shall  forego  commenting  here  on  solid  projective  geometry  and  its 
principle  of  duality,  and  pass  now  to  a  chapter  devoted  to  the  important 
subject  of  algebraic  structure. 


_ PROBLEMS 

4.1.1  (a)  Consider  the  following  definitions  taken  from  an  elementary 
geometry  text:  (1)  The  diagonals  of  a  quadrilateral  are  the  two  straight  line 
segments  joining  the  two  pairs  of  opposite  vertices  of  the  quadrilateral. 
(2)  Parallel  lines  are  straight  lines  that  lie  in  the  same  plane  and  that  never 
meet,  however  far  they  are  extended  in  either  direction.  (3)  A  parallelogram 
is  a  quadrilateral  having  its  opposite  sides  parallel. 

Now,  without  using  any  of  the  italicized  words  above,  restate  the 
proposition,  “The  diagonals  of  a  parallelogram  bisect  each  other.” 

(b)  In  elementary  algebra,  if  n  represents  a  positive  integer  and  if  k 
represents  any  real  number,  we  define  kn  as  a  symbol  to  represent  the 
product  (k)(k)  •  •  •  (k),  in  which  k  appears  as  a  factor  n  times.  Rewrite,  without 
using  exponents,  the  following  expression  in  which  a ,  b,  c  represent  real 
numbers: 

[( a  +  b)b(a  —  c)3]7. 

(c)  By  means  of  appropriate  explicit  definitions  reduce  the  following 
sentence  to  one  containing  not  more  than  seven  words:  “The  movable  seats 
with  four  legs  and  a  back  were  restored  to  a  sound  state  by  the  person  who 
takes  care  of  the  building.” 

These  exercises  illustrate  the  convenience  of  explicit  definitions. 

4.1.2  Give  the  customary  definitions  of  the  following  mathematical  symbols 

OO 

and  illustrate  their  convenience:  (a)  an,  (b)  n\,  n  a  positive  integer, 

n  —  1 

(c)  rn  and  n  positive  integers  with  m  ^  n. 

4.1.3  Trace  the  following  words  through  a  standard  dictionary  until  a 
circular  chain  has  been  established:  (a)  dead,  (b)  noisy,  (c)  line  (in  the 
mathematical  sense). 

4.1.4  Answer  the  following  questions  intuitively,  and  then  check  your 
answers  by  calculation: 

(a)  A  car  travels  from  P  to  Q  at  the  rate  of  40  miles  per  hour  and  then 
returns  from  Q  to  P  at  the  rate  of  60  miles  per  hour.  What  is  the  average 
rate  for  the  round  trip? 

(b)  A  can  do  a  job  in  4  days,  and  B  can  do  it  in  6  days.  How  long 
will  it  take  A  and  B  together  to  do  the  job? 

(c)  A  man  sells  half  of  his  apples  at  3  for  17  cents  and  then  sells  the 
other  half  at  5  for  17  cents.  At  what  rate  should  he  sell  all  of  his  apples 
in  order  to  make  the  same  income? 
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(d)  If  a  ball  of  yarn  4  inches  in  diameter  costs  20  cents,  how  much 
should  you  pay  for  a  ball  of  yarn  6  inches  in  diameter? 

(e)  Two  jobs  have  the  same  starting  salary  of  $6000  per  year  and  the 
same  maximum  salary  of  $12,000  per  year.  One  job  offers  an  annual  raise 
of  $800  and  the  other  offers  a  semiannual  raise  of  $200.  Which  is  the  better 
paying  job? 

(f)  Each  bacterium  in  a  certain  culture  divides  into  two  bacteria  once 
a  minute.  If  there  are  20  million  bacteria  present  at  the  end  of  one  hour, 
when  were  there  exactly  10  million  bacteria  present? 

(g)  Is  a  salary  of  1  cent  for  the  first  half  month,  2*  cents  for  the  second 
half  month,  4  cents  for  the  third  half  month,  8  cents  for  the  fourth  half 
month,  and  so  on  until  the  year  is  used  up,  a  good  or  a  poor  total  salary 
for  the  year? 

(h)  A  clock  strikes  six  in  5  seconds.  How  long  will  it  take  to  strike 
twelve? 

(i)  A  bottle  and  a  cork  together  cost  $1.10.  If  the  bottle  costs  a  dollar 
more  than  the  cork,  how  much  does  the  cork  cost? 

(j)  Suppose  that  in  one  glass  there  is  a  certain  quantity  of  a  liquid  A, 
and  in  a  second  glass  an  equal  quantity  of  another  liquid  B.  A  spoonful 
of  liquid  A  is  taken  from  the  first  glass  and  put  into  the  second  glass,  then 
a  spoonful  of  the  mixture  from  the  second  glass  is  put  back  into  the  first 
glass.  Is  there  now  more  or  less  liquid  A  in  the  second  glass  than  there  is 
liquid  B  in  the  first  glass? 

(k)  Suppose  that  a  large  sheet  of  paper  one  one-thousandth  of  an  inch 
thick  is  torn  in  half  and  the  two  pieces  put  together,  one  on  top  of  the  other. 
These  are  then  torn  in  half,  and  the  four  pieces  put  together  in  a  pile.  If 
this  process  of  tearing  in  half  and  piling  is  done  50  times,  will  the  final  pile 
of  paper  be  more  or  less  than  a  mile  high? 

(l)  Is  a  discount  of  15  per  cent  on  the  selling  price  of  an  article  the 
same  as  a  discount  of  10  per  cent  on  the  selling  price  followed  by  a  discount 
of  5  per  cent  on  the  reduced  price? 

(m)  Four  fourths  exceeds  three  fourths  by  what  fractional  part? 

(n)  A  boy  wants  the  arithmetical  average  of  his  eight  grades.  He 
averages  the  first  four  grades,  then  the  last  four  grades,  and  then  finds  the 
average  of  these  averages.  Is  this  correct? 

These  exercises  illustrate  the  danger  of  using  intuition  in  arriving  at 

results. 

4.1.5  In  each  of  the  following,  is  the  given  conclusion  a  valid  deduction 
from  the  given  pair  of  premises? 

(a)  If  today  is  Saturday,  then  tomorrow  will  be  Sunday. 

But  tomorrow  will  be  Sunday. 

Therefore,  today  is  Saturday. 

(b)  Germans  are  heavy  drinkers. 

Germans  are  Europeans. 

Therefore,  Europeans  are  heavy  drinkers. 
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(c)  If  a  is  b,  then  c  is  d. 

But  c  is  d. 

Therefore,  a  is  b. 

(d)  All  a’s  are  b’s. 

All  a’s  are  c’s. 

Therefore,  all  c’s  are  b’s. 

These  exercises  illustrate  how  a  person  may  allow  the  meanings  which 
he  associates  with  words  or  expressions  to  dominate  his  logical  analysis. 
There  is  a  greater  tendency  to  go  wrong  in  (a)  and  (b)  than  in  (c)  and  (d), 
which  are  symbolic  counterparts  of  (a)  and  (b). 

4.1.6  We  repeat  here  the  postulate  set  of  Problem  3.5.8. 

PI :  Every  aba  is  a  collection  of  dabas. 

P2:  There  exist  at  least  two  dabas. 

P3:  If  p  and  q  are  two  dabas,  then  there  exists  one  and  only  one  aba 
containing  both  p  and  q. 

P4:  If  L  is  an  aba,  then  there  exists  a  daba  not  in  L. 

P5:  If  L  is  an  aba,  and  p  is  a  daba  not  in  T,  then  there  exists  one  and 
only  one  aba  containing  P  and  not  containing  any  daba  that  is  in  L. 

(a)  What  are  the  primitive  terms  in  this  postulate  set? 

(b)  Deduce  the  following  theorems  from  this  postulate  set:  (1)  Every 
daba  is  contained  in  at  least  two  abas.  (2)  Every  aba  contains  at  least  two 
dabas.  (3)  There  exist  at  least  four  distinct  dabas.  (4)  There  exist  at  least 
six  distinct  abas. 

(c)  Define  a  kurple  as  any  three  dabas  not  contained  in  the  same  aba. 
What  is  a  kurple  if  we  interpret  daba  as  “point”  and  aba  as  “straight  line?” 

4.1.7  (a)  Establish  the  following  consequences  of  the  first  five  postulates  of 
Pieri’s  postulate  set  for  Euclidean  geometry  (as  given  in  Section  4.1).  (1)  If 
C  and  D  are  two  distinct  points  of  the  straight  line  AB ,  then  A  and  B  are 
points  of  the  straight  line  CD.  (2)  If  three  points  are  on  a  straight  line,  then 
the  three  points  corresponding  to  them  in  any  motion  are  also  on  a  straight 
line. 

(b)  Pieri  defines  a  sphere  as  follows:  “If  A  and  B  are  two  distinct  points, 
then  the  aggregate  of  all  points  P  such  that  for  each  P  there  exists  a  motion 
which  leaves  A  fixed  but  makes  P  correspond  to  B  is  called  the  sphere  of 
center  A  passing  through  B.”  Establish  the  following  consequences  of  this 
definition  and  Pieri’s  first  five  postulates:  (1)  A  sphere  transforms  into  a 
sphere  in  every  motion.  (2)  A  motion  which  leaves  the  center  of  a  sphere 
fixed  transforms  the  sphere  into  itself.  (3)  If  two  spheres  with  centers  A  and  B 
have  only  one  point  C  in  common,  then  the  three  points  A,  B,  C  lie  on  a  line. 

(c)  Try  to  formulate,  in  terms  of  motion,  a  suitable  definition  of 

perpendicularity. 

4.2.1  Prove  the  theorem  that  follows  Postulate  1 1 1-3  in  Hilbert’s  postulate 
set  for  plane  Euclidean  geometry. 

4.2.2  Deduce  the  following  theorems  from  Hilbert’s  postulate  set  for  plane 
Euclidean  geometry: 
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(a)  If  point  B  is  between  points  A  and  D,  and  point  C  is  between  B 
and  D ,  then  C  is  between  A  and  D. 

(b)  There  is  no  limit  to  the  number  of  distinct  points  between  two 

given  distinct  points. 

(c)  If  neither  of  two  distinct  lines,  a  and  b,  intersects  a  third  line  r, 
then  a  and  b  do  not  intersect. 

(d)  If  two  sides  and  the  included  angle  of  one  triangle  are  congruent, 
respectively,  to  two  sides  and  the  included  angle  of  another  triangle,  then 
all  the  parts  (angles  and  sides)  of  the  first  triangle  are  congruent  to  the 
corresponding  parts  of  the  second  triangle. 

(e)  If  two  angles  and  the  included  side  of  one  triangle  are  congruent, 
respectively,  to  two  angles  and  the  included  side  of  another  triangle,  then 
all  the  parts  (angles  and  sides)  of  the  first  triangle  are  congruent  to  the 
corresponding  parts  of  the  second  triangle. 

4.2.3  Try  to  deduce  the  following  proposition  from  Hilbert’s  postulate  set 
for  plane  Euclidean  geometry:  Given  any  four  points  on  a  line,  it  is  always 
possible  to  denote  them  by  letters  A,  B ,  C,  D  in  such  a  way  that  B  is  between 
A  and  C  and  also  between  A  and  D,  and  that  C  is  between  A  and  D  and  also 
between  B  and  D. 

This  proposition  was  included  as  a  postulate  in  the  first  edition  of 
Hilbert’s  work,  but  was  later  proved  by  E.  H.  Moore  to  be  a  consequence 
of  Hilbert’s  other  postulates;  that  is,  in  the  technical  jargon  of  mathematics, 
the  proposition  given  above  is  not  independent  of  the  remaining  propositions 
listed  by  Hilbert  as  postulates.  * 

4.2.4  (a)  Consider  the  configuration  formed  by  the  positive  x  axis  and  a 
directed  circular  arc  C  of  radius  r  radiating  from  the  origin  0  and  such  that 
the  arc  C  is  convex  when  viewed  from  its  right-hand  side.  We  shall  call 
such  a  configuration  a  (special  kind  of)  horn  angle ,  and  denote  it  by  h.  Let  T 
be  the  directed  tangent  to  C  at  0,  and  designate  the  positive  (counter¬ 
clockwise)  angle  from  the  positive  x  axis  around  to  T  by  9.  We  shall  compare 
two  such  horn  angles  h  and  h'  in  the  following  way.  If  6  =  6  and  r  =  r  , 
then  we  say  h  —  h'\  if  9  >  9',  we  say  h  >  li  \  if  6  =  6  but  r  <C  r  ,  then 
again  we  say  h  >  h' .  We  further  say  h'  =  nh,  where  n  is  a  positive  integer, 
if  and  only  if  6'  =  nd  and  r'  —  rjn.  Show  that  our  horn  angles  now  form 
a  non-Archimedean  system  of  entities;  that  is,  show  that  there  exist  horn 
angles  h  and  h!  such  that  nh  <  h'  for  every  positive  integer  n. 

(b)  Show  that  a  horn  angle  for  which  9  =  0  may  be  trisected  with 
straightedge  and  compasses. 

(c)  Consider  pairs  of  power  series  of  the  forms  y  =  a\x  +  a2x2  +  a 3x3 
A~  •  •  •  and  p/  =  a'  \X  T  a! 2x2  T  a' 3X3  T  '  '  '  ?  where  the  coefficients  are  real 
numbers.  We  shall  compare  two  such  power  series  as  follows.  We  say  y  =  / 
if  and  only  if  at-  =  a' i  for  all  i\  we  sayjy  >  y'  if  and  only  if  there  exists  some 
positive  integer  k  such  that  a\  =  a'\ ,  a2  =  a'2,  •  •  •,  a^- i  =  a  k— n  ak  >  a  a- 
We  further  say  that  y  =  ny ,  where  n  is  a  positive  integer,  if  and  only  if 
a\  =  nal  for  all  i.  Show  that  the  set  of  all  such  power  series  form,  under  the 


*See  E.  H.  Moore  [1]. 
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above  definitions,  a  non-Archimedean  system  of  entities.  [This  can  be  inter¬ 
preted  as  a  generalization  of  part  (a),  where  the  circular  arcs  C  have  been 
replaced  by  analytic  curves  passing  through  0.] 

(d)  Consider  a  set  of  entities  of  which  a  typical  member  M  is  composed 
of  the  segment  —  a  S  x  ^  0  (a  >  0),  and  the  isolated  points  x  =  1,2,  •  •  •,  k. 
Devise  a  method  of  comparing  such  entities,  and  define  nM,  where  n  is  a 
positive  integer,  such  that  the  entities  form  a  non-Archimedean  system. 

(e)  Let  z  —  a  -f-  ib  and  z!  —  a'  +  ib'  ( a ,  b,  a',  b'  real  and  i  —  V—  1) 
be  two  complex  numbers.  Set  z  —  z'  if  and  only  if  a  =  a'  and  b  =  bf .  If 
a  >  a',  set  z  >  z!  \  if  a  =  d  but  b  >  b',  again  set  z  >  z'-  Define  nz  = 
na  +  i(nb).  Show  that  the  complex  numbers,  under  the  above  definitions, 
form  a  non-Archimedean  system  of  entities. 

(f)  Show  that  the  set  of  all  coplanar  vectors  radiating  from  a  point  0 
can  be  formed  into  a  non-Archimedean  system. 

4.2.5  It  is  pointed  out  in  Section  4.2  that  Huntington  has  given  a  postulate 
set  for  Euclidean  geometry  of  space  in  which  sphere  is  taken  as  a  primitive 
element  and  inclusion  as  a  primitive  relation.  Try  to  formulate,  in  terms  of 
these  primitive  terms,  suitable  definitions  for  point ,  segment ,  ray ,  and  line. 

4.3.1  In  Figure  23,  show  that  lim  AB  =  oo . 

a— r 

4.3.2  Let  II  be  a  fixed  circle  of  center  0  and  radius  p,  and  let  P  be  any 
point  in  the  plane  of  IT.  Then  the  point  P'  on  the  ray  OP  such  that  OP  •  OP' 
=  p2  is  called  the  inverse  of  P  with  respect  to  circle  IT.  Circle  II  is  called  the 
circle  of  inversion ,  point  0  the  center  of  inversion ,  and  p2  the  power  of  inversion. 
There  is  set  up  a  one-to-one  correspondence  between  the  points  of  the  plane 
of  II;  to  every  point,  except  0,  there  is  a  corresponding  point.  The  points 
of  a  curve  C  will  invert  into  the  points  of  a  curve  C',  called  the  inverse  of  C. 
Establish  the  following  theorems  concerning  this  inversion  transformation. 

(a)  If  P'  is  the  inverse  of  P,  then  P  is  the  inverse  of  P'. 

(b)  A  point  inside  the  circle  of  inversion  inverts  into  a  point  outside 
the  circle  of  inversion;  a  point  outside  the  circle  of  inversion  inverts  into  a 
point  inside  the  circle  of  inversion;  a  point  on  the  circle  of  inversion  inverts 
into  itself. 

(c)  A  straight  line  through  the  center  of  inversion  inverts  into  itself. 

(d)  A  circle  orthogonal  to  the  circle  of  inversion  inverts  into  itself. 

(e)  If  P,F  and  Q,  Q'  are  any  two  pairs  of  inverse  points  which  do  not 
lie  on  the  same  diameter  of  the  circle  of  inversion,  then  these  four  points  lie 
on  a  circle  and  f.0PQ  =  <£0QTP'  and  <fOQP  =  flOP'Q. 

(f)  A  straight  line  which  does  not  pass  through  the  center  of  inversion 
inverts  into  a  circle  passing  through  the  center  of  inversion,  and  conversely. 

(g)  A  circle  which  does  not  pass  through  the  center  of  inversion  inverts 
into  a  circle  which  does  not  pass  through  the  center  of  inversion. 

(h)  Any  circle  through  a  pair  of  inverse  points  P  and  P'  cuts  the  circle 
of  inversion  orthogonally. 

(i)  A  given  circle  may  be  inverted  into  itself  by  the  use  of  any  given 
exterior  point  as  center  of  inversion. 
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4.3.3  Prove  that  in  an  inversion  the  angle  between  two  intersecting  curves 
is  equal  to  the  corresponding  angle  between  the  two  inverse  curves. 

A  transformation  which  preserves  angles  between  curves  is  called  a 
conformal  transformation.  More  briefly  stated,  then,  the  theorem  concerned 
says  that  “inversion  is  a  conformal  transformation.” 

4.3.4  (a)  Show  that  the  cross  ratio  of  four  points  on  a  circle  is  invariant 
under  inversion. 

(b)  Let  P,  Q  be  two  points  in  the  Poincare  model,  and  let  P',  (C  be 
their  inverses  with  respect  to  some  line.  Show  that  P'Q'  =  PQ 

4.3.5  Let  ABC  be  any  triangle  in  the  Poincare  model  such  that  C  is  not 
at  the  center  of  2.  Show  that  there  is  a  suitable  line  m  such  that  inversion 
with  respect  to  circle  m  will  carry  2  into  itself  and  triangle  ABC  into  a 
congruent  triangle  A' B'C',  where  C'  is  at  the  center  0  of  2  and  A  C , 
B'C'  are  straight  line  segments. 

4.3.6  Show  that  if,  in  triangles  ABC ,  A'B'C'  of  the  Poincare  model, 
AB  =  A'B',  AC  =  A'C',  <£A  =  <£A',  then  <£B  =  <£B'.  (This  verifies 
Postulate  II 1-6  in  the  Poincare  model.) 

4.4.1  Prove  the  following  theorems  connected  with  the  algebraic  model  of 
Section  4.4: 

(a)  If  a  point  satisfies  an  equation  of  a  line,  then  it  satisfies  every 
equation  of  the  line. 

(b)  The  pair  of  points  (*i,  jvi),  (*2,  JV2)  congruent  to  the  pair  of  points 
(*2,  Vz),  (xi,yi). 

(c)  If  t  is  between  0  and  1,  then  \/t  is  not  between  0  and  1. 

(d)  If  (*i,jyi)  and  (*2,^2)  are  distinct  points,  and  if 

x  =  {\  —  +  ^*2,  y  =  (1  ~  OjVi  +  ty 2, 

and  also 

x  =  (\  —  r)xi  +  rx2,  y  =  (\  —  r)y\  +  ry2, 

then  r  =  t. 

(e)  If  in  (4)  of  Section  4.4  we  take  t  =  1/2,  then  the  distance  between 
(xhyi)  and  (x,y)  is  equal  to  the  distance  between  (x,y)  and  (*2,^2).  [The 
point  (x,y)  is  called  the  midpoint  between  (*i,jVi)  and  (x2,  ^2)-] 

(0  The  lines  ax  +  by  -J-  c\  =  0  and  ax  +  by  +  C2  —  0,  where  c  1  7^ 
C2,  do  not  intersect. 

4.4.2  There  is  a  very  useful  inequality,  often  referred  to  as  Schwarz's 
inequality ,  which  asserts  that  if  <21,  <22,  ^1,  ^2  are  any  four  real  numbers  then 

(a ib  1  +  «2^2)2  ^  («i2  +  «22)(^i2  +  ^22)- 

Using  Schwarz’s  inequality,  establish  the  following  inequalities  connected 
with  the  algebraic  model  of  Section  4.4: 

(a)  The  cosine  of  angle  (x2,j^2)(*i,JVi)(*:i,4;;0  is  numerically  less  than 

or  equal  to  1. 

(b)  For  any  three  points  Pi  :  (*i,  j>i),  P2  :  (*2, ^2),  P3  :  (*3,^3), 

distance  P1P3  ^  distance  P1P2  ~t~  distance  P2PA- 
(This  is  referred  to  as  the  triangle  inequality.) 
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4.4.3  Verify  Postulate  1 1-3  in  the  algebraic  model  of  Section  4.4. 

4.4.4  Verify  Postulate  III-l  in  the  algebraic  model  of  Section  4.4. 

4.4.5  Verify  Postulate  1 1 1-2  in  the  algebraic  model  of  Section  4.4. 

4.4.6  Verify  Postulate  1 1 1-3  in  the  algebraic  model  of  Section  4.4. 

4.4.7  Verify  Postulate  1 1 1-5  in  the  algebraic  model  of  Section  4.4. 

4.4.8  Verify  Postulate  1 1 1-6  in  the  algebraic  model  of  Section  4.4. 

4.4.9  Verify  Postulate  V-l  in  the  algebraic  model  of  Section  4.4. 

4.4.10  Verify  Postulates  II-4  and  1 1 1-4  in  the  algebraic  model  of  Section 
4.4.  These  two  postulates  are  somewhat  more  difficult  to  establish  than  are 
the  other  postulates.  See,  for  example,  L.  P.  Eisenhart,  Coordinate  Geometry 
(Boston:  Ginn  and  Company,  1939),  Appendix  to  Chap.  1. 

4.4.11  Prove,  by  using  the  Poincare  model,  that  in  plane  Lobachevskian 
geometry  the  sum  of  the  angles  of  a  triangle  is  always  less  than  two  right 
angles. 

4.5.1  The  consequences  of  Pascal’s  “mystic  hexagram”  theorem  are  very 
numerous  and  attractive,  and  an  almost  unbelievable  amount  of  research 
has  been  expended  on  the  configuration.  There  are  sixty  possible  ways  of 
forming  a  hexagon  from  6  points  on  a  conic,  and,  by  Pascal’s  theorem,  to 
each  hexagon  corresponds  a  Pascal  line.  These  60  Pascal  lines  pass  three  by 
three  through  20  points,  called  Steiner  points ,  which  in  turn  lie  four  by  four 
on  15  lines,  called  Pliicker  lines.  The  Pascal  lines  also  concur  three  by  three 
in  another  set  of  points,  called  Kirkman  points ,  of  which  there  are  60.  Corre¬ 
sponding  to  each  Steiner  point  there  are  three  Kirkman  points  such  that 
all  four  lie  upon  a  line,  called  a  Cayley  line.  There  are  20  of  these  Cayley 
lines,  and  they  pass  four  by  four  through  15  points,  called  Salmon  points. 
There  are  many  further  such  extensions  and  properties  of  the  configuration, 
and  they  are  all  descriptive  properties. 

In  this  set  of  problems  we  shall  consider  a  few  of  the  many  corollaries 
of  the  “mystic  hexagram”  theorem  which  can  be  obtained  by  making  some 
of  the  six  points  coincide  with  one  another.  For  simplicity  we  shall  number 
the  points  1,  2,  3,  4,  5,  6.  Then  Pascal’s  theorem  says  that  the  intersections 
of  the  pairs  of  lines  12,  45;  23,  56;  34,  61  are  collinear  if  and  only  if  the  six 
points  lie  on  a  conic. 

(a)  If  a  pentagon  12345  is  inscribed  in  a  conic,  show  that  the  pairs  of 
lines  12,  45;  23,  51 ;  34  and  the  tangent  at  1,  intersect  in  three  collinear  points. 

(b)  Given  five  points,  draw  at  any  one  of  them  the  tangent  to  the  conic 
determined  by  the  five  points. 

(c)  Given  four  points  of  a  conic  and  the  tangent  at  any  one  of  them, 
construct  further  points  on  the  conic. 

(d)  Show  that  the  pairs  of  opposite  sides  of  a  quadrangle  inscribed  in 
a  conic,  together  with  the  pairs  of  tangents  at  opposite  vertices,  intersect  in 
four  collinear  points. 

(e)  Show  that  if  a  triangle  is  inscribed  in  a  conic,  then  the  tangents  at 
the  vertices  intersect  the  opposite  sides  in  three  collinear  points. 


PROBLEMS  125 


(0  Given  three  points  on  a  conic  and  the  tangents  at  two  of  them, 
construct  the  tangent  at  the  third. 

4.5.2  Prove  the  following  sequence  of  theorems: 

(a)  If  the  vertex  V  of  a  triangle  VAB  is  joined  to  any  point  P  on  the 
line  AB,  then 

AP  VA  sin  A  VP 
PB  =  BV  sin  PVB 

(b)  If  A,  B ,  T,  S  are  four  points  on  a  line  m,  and  if  I7  is  a  fifth  point 
not  on  A?,  then 

AT/BT  sin  AVT/sin  BVT 
AS/BS  sin  ADS/sin  BVS 

(c)  The  cross  ratio  of  four  collinear  points  is  invariant  under  projection. 

4.5.3  (a)  Dualize  4.5.1  (a), 

(b)  Given  five  lines,  find  on  any  one  of  them  the  point  of  contact  of 
the  conic  touching  the  five  lines. 

(c)  Given  four  tangents  to  a  conic  and  the  point  of  contact  of  any  one 
of  them,  construct  further  tangents  to  the  conic. 

(d)  Dualize  4.5.1  (d). 

(e)  Dualize  4.5.1  (e). 

(0  Given  three  tangents  to  a  conic  and  the  points  of  contact  of  two 
of  them,  construct  the  point  of  contact  of  the  third. 

(g)  Dualize  the  following  theorem,  known  as  Desargues’s  two-triangle 
theorem :  “If  two  triangles  (in  the  same  plane)  are  so  situated  that  lines  joining 
pairs  of  corresponding  vertices  are  concurrent,  then  the  points  of  intersection 
of  pairs  of  corresponding  sides  are  collinear,  and  conversely.”  Actually,  the 
theorem  holds  whether  the  two  triangles  are  in  the  same  plane  or  not. 

4.5.4  (a)  If  m  is  a  given  line  in  a  given  plane  i r,  and  0  is  a  given  center 
of  projection  (not  on  7r),  show  how  to  find  a  plane  tt'  such  that  the  projection 
of  m  onto  7 v  will  be  the  line  at  infinity  on  tt' .  (The  operation  of  selecting  a 
suitable  center  of  projection  0  and  plane  of  projection  7b  so  that  a  given  line 
on  a  given  plane  shall  project  into  the  line  at  infinity  on  7r/  is  called  “pro¬ 
jecting  the  given  line  to  infinity.”  Many  descriptive  theorems  concerning  a 
given  figure  may  be  proved  more  readily  in  a  specialized  figure  obtained 
by  projecting  a  particular  line  of  the  given  figure  to  infinity.  This  device 
is  to  be  used  in  the  following  parts.) 

(b)  Let  UP,  UQ,  UR  be  three  concurrent  coplanar  lines,  cut  by  two 
lines  OX  and  OT  in  Ph  Qb°Ri  and  P2,  Qji ,  R2,  respectively.  Prove  that  the 
intersections  of  Q1R2  and  Qj.R\,  R1P2  and  R2P\,  P \Qji  and  P2Q1  are  collinear. 

(c)  Prove  that  if  A1BX1  and  A2B2C2  are  two  coplanar  triangles  such 
that  B\C\  and  B2C2  meet  in  L,  C\A\  and  C2A2  meet  in  M,  A\B\,  and  A2B2 
meet  in  N,  where  L ,  M ,  jVare  collinear,  then  A\A2,  B\B2,  C\C2  are  concurrent. 
[This  is  the  converse  part  of  the  statement  of  Desargues’s  two-triangle  theorem 
as  given  in  Problem  4.5.3  (g).] 
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(d)  Show  that  by  parallel  projection  (a  projection  where  the  center  of 
projection  is  at  infinity)  an  ellipse  may  always  be  projected  into  a  circle. 

(e)  In  1678  the  Italian  Giovanni  Ceva  {ca.  1647-1736)  published  a 
work  containing  the  following  theorem  now  known  by  his  name:  The  three 
lines  which  join  three  points  L,  M,  N  on  the  sides  BC,  CA,  AB  of  a  triangle  ABC 
to  the  opposite  vertices  are  concurrent  if  and  only  if 

/AN\/BL\/CM\ 

(nb)vlc)\ma)  +  ' 

Using  Ceva’s  theorem,  prove  that  the  lines  joining  the  vertices  of  a  triangle 
to  the  opposite  points  of  contact  of  the  inscribed  circle  are  concurrent.  Then, 
by  means  of  part  (d),  prove  that  the  lines  joining  the  vertices  of  a  triangle 
to  the  opposite  points  of  contact  of  an  inscribed  ellipse  are  concurrent. 

4.5.5  Using  the  pole  and  polar  theory  as  sketched  in  Section  4.5,  show 
how  to  draw,  with  straightedge  alone,  the  tangents  to  a  given  ellipse  from 
a  given  external  point. 

4.5.6  Consider  the  following  set  of  postulates: 

PI :  There  exist  a  point  and  a  line  such  that  the  point  is  not  on  the  line. 
P2:  Every  line  contains  at  least  three  distinct  points. 

P3:  Any  two  distinct  points  are  on  just  one  line. 

P4:  Any  two  distinct  lines  contain  at  least  one  common  point. 

(a)  Show  that  the  postulate  set  implies  the  dual  of  each  postulate  of  the 
set. 

(b)  Show  that  the  postulate  set  is  equivalent  to  the  postulate  set  for 
plane  projective  geometry  given  in  Section  4.5. 

(c)  Since  there  are  plane  geometries  satisfying  the  above  postulates  but 
in  which  Desargues’  theorem  does  not  hold  (such  geometries  are  called 
non-Desar guesian) ,  the  following  postulate  is  often  added. 

P5:  IT  two  triangles  have  the  joins  of  corresponding  vertices  concurrent, 
then  the  intersections  of  corresponding  sides  are  collinear. 

Deduce  the  dual  of  this  postulate. 

4.5.7  If,  in  a  trigonometric  equation,  each  trigonometric  function  that 
appears  is  replaced  by  its  cofunction,  the  new  equation  obtained  is  called 
the  dual  of  the  original  equation.  Establish  the  following  principle  of  duality 
of  trigonometry:  If  a  trigonometric  equation  involving  a  single  angle  is  an 
identity,  then  its  dual  is  also  an  identity. 

4.5.8  Look  up,  in  a  text  on  solid  geometry,  the  definition  of  polar  triangles 
on  a  sphere.  Can  this  concept  be  used  to  establish  a  principle  of  duality  in 
the  geometry  of  spherical  triangles  on  a  given  sphere? 

4.5.9  An  interesting  development  in  coordinate  systems  was  inaugurated 
by  Julius  Plucker  (1801-1868)  in  1829,  when  he  noted  that  our  fundamental 
element  need  not  be  the  point,  but  can  be  any  geometric  entity.  Thus  if 
we  choose  the  straight  line  as  our  fundamental  element,  we  might  locate 
any  straight  line  not  passing  through  the  origin  of  a  given  rectangular 
Cartesian  frame  of  reference  by  recording,  say,  the  x  and  y  intercepts  of  the 
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given  line.  Plucker  actually  chose  the  negative  reciprocals  of  these  intercepts 
as  the  location  numbers  of  the  line  and  considerably  exploited  the  analytic 
geometry  of  these  so-called  line  coordinates.  A  point  now,  instead  of  having 
coordinates,  possesses  a  linear  equation,  namely  the  equation  satisfied  by 
the  coordinates  of  all  the  lines  passing  through  the  point.  The  double 
interpretation  of  a  pair  of  coordinates  as  either  point  coordinates  or  line 
coordinates  and  of  a  linear  equation  as  either  the  equation  of  a  line  or  the 
equation  of  a  point  furnishes  the  basis  of  Pliicker’s  analytical  proof  of  the 
principle  of  duality  of  plane  projective  geometry. 

(a)  Find  the  Plucker  coordinates  of  the  lines  whose  Cartesian  equations 
are  5*  -f  3y  —  6  =  0  and  ax  +  by  +  1  =0.  Write  the  Cartesian  equation 
of  the  line  having  Plucker  coordinates  (1,  3). 

(b)  Show  that  the  Plucker  coordinates  u ,  v  of  all  lines  passing  through 
the  point  with  Cartesian  coordinates  (2,  3)  satisfy  the  linear  equation 
2u  +  3v  T  1  =0.  This  equation  is  taken  as  the  Plucker  equation  of  the 
point  (2,  3).  What  are  the  Cartesian  coordinates  of  the  points  whose  Plucker 
equations  are  5u  +  3v  —  6  =  0  and  au  +  bv  +  1  =0?  Write  the  Plucker 
equation  of  the  point  having  Cartesian  coordinates  (1,  3). 


_  CHAPTER  FIVE 

ALGEBRAIC  STRUCTURE 


5.1  Emergence  of  Algebraic  Structure 

As  remarked  at  the  beginning  of  the  last  chapter,  there  was  another 
factor,  besides  the  discovery  of  non-Euclidean  geometry,  which  greatly 
influenced  the  development  of  the  axiomatic  method  and  hence  the  character 
of  much  of  modern  mathematical  research.  This  other  factor  was  the 
recognition,  first  by  British  mathematicians  in  the  earlier  half  of  the  nine¬ 
teenth  century,  of  the  existence  of  structure  in  algebra. 

Let  us  clarify  what  we  mean  by  the  phrase,  algebraic  structure.  In  studying 
the  ordinary  arithmetic  of  the  positive  integers,  one  encounters  two  opera¬ 
tions  called  '‘‘addition”  and  “multiplication.”  The  notion  of  an  operation, 
which  is  fundamental  in  mathematics,  stems  from  the  more  general  idea  of 
function.  A  formal  definition  may  be  phrased  as  follows: 

An  operation  on  a  set  d1  of  elements  is  a  rule  which  assigns  to  each  ordered 
subset  of  n  elements  of  S  a  uniquely  defined  element  of  the  same  set  S; 
according  to  whether  n  =  1,  2,  3,  •••,«,  •  •  •,  the  operation  is  said  to  be 
unary ,  binary ,  ternary ,  •  •  • ,  n-ary ,  •  •  • . 

We  see,  then,  three  criteria  for  a  rule  to  be  called  an  n- ary  operation 
on  a  set  S.  First,  the  rule  must  define  a  result  for  every  ordered  subset  of  n 
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elements  of  S.  Thus  division  is  not  a  binary  operation  on  the  set  of  all  real 
numbers,  since  §,  for  example,  is  not  defined.  Second,  the  result  must  always 
be  unique.  Finding  a  solution,  then,  to  the  equation  x2  =  a2  is  not  a  unary 
operation  on  the  set  of  real  numbers  a,  since  we  can  have  either  x  =  a  or 
x  =  —a.  Third,  the  result  must  always  be  an  element  of  the  given  set  S. 
Thus  finding  y/  a  is  not  a  unary  operation  on  the  set  of  all  real  numbers  a, 
since  V a  might  not  be  real  as  in  the  case  of  y/—\. 

An  example  of  a  unary  operation  performed  on  the  set  of  positive 
integers  is  that  of  squaring,  whereby  to  each  positiveTnteger  p  is  assigned 
the  positive  integer  p2.  As  other  examples  of  unary  operations  performed  on 
the  set  of  positive  integers  we  might  mention  that  of  taking  the  successor 
and  that  of  taking  the  factorial,  whereby  to  each  positive  integer  p  are 
assigned,  respectively,  the  positive  integers  p  +  1  and  (1 )  (2)  (3)  *  •  •(/?). 

Ordinary  addition  and  multiplication  performed  on  the  set  of  positive 
integers  are  binary  operations;  to  each  ordered  pair  of  positive  integers 
a  and  b  are  assigned  unique  positive  integers  c  and  d,  called,  respectively, 
the  sum  of  a  and  b  and  the  product  of  a  and  b,  and  denoted  by  the  symbols 

c  =  a  T  b ,  d  —  a  X  b . 

These  two  binary  operations  of  addition  and  multiplication  performed  on 
the  set  of  positive  integers  possess  certain  basic  properties.  For  example,  if 
<2,  b,  c  denote  arbitrary  positive  integers,  we  have 

1.  a  +  b  =  b  -f-  a,  the  so-called  commutative  law  for  addition. 

2.  a  X  b  —  b  X  a,  the  commutative  law  for  multiplication. 

3.  (a  +  b)  +  c  =  a  +  (b  +  c),  the  associative  law  for  addition. 

4.  (a  X  b)  X  c  =  a  X  (b  X  c),  the  associative  law  for  multiplication. 

5.  a  X  if  +  c)  =  (a  X  b)  +  (a  X  c),  the  distributive  law  for  multiplication 
over  addition. 

In  the  early  nineteenth  century,  algebra  was  considered  simply  symbol¬ 
ized  arithmetic.*  In  other  words,  instead  of  working  with  specific  numbers, 
as  in  arithmetic,  in  algebra  letters  are  used  that  represent  these  numbers. 
The  above  five  properties,  then,  are  statements  which  always  hold  in  the 
algebra  of  positive  integers.  But,  since  the  statements  are  symbolic,  it  is 
conceivable  that  they  might  be  applicable  to  some  set  of  elements  other  than 
the  positive  integers,  provided  we  supply  appropriate  definitions  for  the  two 
binary  operations  involved.  This  is  indeed  the  case,  as  the  following  examples 
amply  illustrate.  If  in  each  case  we  denote  the  set  of  elements  by  S,  it  is  an 
easy  matter  to  verify  that  the  elements,  under  the  two  given  binary  operations 
of  -f  and  X,  satisfy  all  five  of  the  above  basic  properties.  In  each  example, 
equality  is  employed  in  the  sense  of  identity. 

*This  is  still  the  view  of  algebra  as  taught  in  many  high  schools  and  frequently  in 
the  freshman  year  at  college. 
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_ EXAMPLES 

(a)  Let  S  be  the  set  of  all  even  positive  integers,  and  let  +  and  X  denote  the  usual 
addition  and  multiplication. 

(b)  Let  S  be  the  set  of  all  rational  numbers  (integers  and  fractions,  positive,  negative, 
and  zero),  and  let  +  and  X  denote  the  usual  addition  and  multiplication. 

(c)  Let  S  be  the  set  of  all  real  numbers,  and  let  +  and  X  denote  the  usual  addition 
and  multiplication. 

(d)  Let  S  be  the  set  of  all  real  numbers  of  the  form  m  +  nV 2,  where  m  and  n  are 
integers,  and  let  +  and  X  denote  the  usual  addition  and  multiplication. 

(e)  Let  S  be  the  set  of  Gaussian  integers  (complex  numbers  m  +  in,  where  m  and  n 
are  ordinary  integers  and  i  —  V7— 1),  and  let  +  and  X  denote  the  usual 
addition  and  multiplication  of  complex  numbers. 

(f)  Let  S  be  the  set  of  all  ordered  pairs  (m,  n)  of  integers,  and  let  (a,  b )  +  ( c ,  d ) 
=  {a  -T-  c,  b  +  d)  and  {a,  b)  X  ( c ,  d)  =  ( ac ,  bd). 

(g)  Let  S  be  the  set  of  all  ordered  pairs  (m,  n)  of  integers,  and  let  (a,  b )  +  ( c ,  d) 

=  (a  +  c,  b  +  d)  and  ( a ,  b )  X  ( c ,  d)  =  {ac  —  bd,  ad  +  be). 

(h)  Let  S  be  the  set  of  all  real  polynomials  in  the  real  variable  x,  and  let  +  and  X 

denote  the  ordinary  addition  and  multiplication  of  polynomials. 

(i)  Let  S  be  the  set  of  all  real-valued  continuous  functions  of  the  variable  x  defined 
on  the  closed  interval  0  S  x  ^  1,  and  let  -f-  and  X  denote  ordinary  addition 
and  multiplication. 

(j)  Let  S  be  the  set  consisting  of  just  two  distinct  elements  m  and  n,  where  we  define 

m  n  =  m,  m  X  m  =  m, 

m-\-n  =  n-\-m  —  n,  m'Xn  =  nX.m  =  m, 

n  -j-  n  =  m,  n  X  n  =  n. 

(k)  Let  S  be  the  set  of  all  sets  of  points  in  a  plane.  If  a  and  b  are  two  members  of  S, 

define  a  +  b  to  be  the  set  of  points  belonging  to  either  set  a  or  set  b  (or  to  both 

a  and  b ),  and  define  a  X  Ho  be  the  set  of  points  belonging  to  both  set  a  and 
set  b.  As  a  special  member  of  S  we  introduce  an  ideal  set,  the  null  set,  which 
which  has  no  points  in  it. 

In  view  of  these  examples,  which  can  easily  be  extended  in  number, 
it  is  apparent  that  the  five  basic  properties  of  positive  integers  listed  above 
may  also  be  regarded  as  properties  of  many  other  entirely  different  systems 
of  elements.  The  consequences  of  the  above  five  properties  constitute  an 
algebra  applicable  to  the  positive  integers,  but  it  is  also  evident  that  the 
consequences  of  the  five  properties  also  constitute  an  algebra  applicable  to 
many  other  systems.  That  is  to  say,  there  is  a  common  algebraic  structure 
(the  five  basic  properties  and  their  consequences)  attached  to  many  different 
systems.  The  five  basic  properties  may  be  regarded  as  postulates  for  a 
particular  type  of  algebraic  structure,  and  any  theorem  formally  implied  by 
these  postulates  would  be  applicable  to  each  of  the  examples  given  above,  or 
to  any  other  interpretation  satisfying  the  five  basic  properties.  Considered 
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from  this  view,  then,  algebra  is  severed  from  its  tie  to  arithmetic,  and  an 
algebra  becomes  a  purely  formal  hypothetico-deductive  study. 

The  earliest  glimmerings  of  the  above  modern  view  of  algebra  appeared 
about  1830  in  England,  with  the  work  of  George  Peacock  (1791-1858). 
Peacock  was  one  of  the  first  to  study  seriously  the  fundamental  principles 
of  algebra.*  He  made  a  distinction  between  what  he  called  “arithmetical 
algebra”  and  “symbolical  algebra.”  The  former  was  regarded  by  Peacock 
as  the  study  which  results  from  the  use  of  symbols  to  denote  ordinary 
positive  decimal  numbers,  together  with  signs  for  the  operations,  like  addition 
and  subtraction,  to  which  these  numbers  may  be  subjected.  Now,  in  “arith¬ 
metical  algebra,”  certain  operations  are  limited  in  their  applicability.  For 
example,  in  a  subtraction,  a  —  b,  we  must  have  a  greater  than  b.  Peacock’s 
“symbolical  algebra,”  on  the  other  hand,  adopts  the  operations  of  “arith¬ 
metical  algebra”  but  ignores  their  restrictions.  Thus  subtraction  in  “sym¬ 
bolical  algebra”  differs  from  the  same  operation  in  “arithmetical  algebra” 
in  that  it  is  to  be  regarded  as  always  applicable.  The  justification  of  this 
extension  of  the  rules  of  “arithmetical  algebra”  to  “symbolical  algebra” 
was  called,  by  Peacock,  the  principle  of  the  permanence  of  equivalent  forms. 
Peacock’s  “symbolical  algebra”  is  a  universal  “arithmetical  algebra”  whose 
operations  are  determined  by  those  of  “arithmetical  algebra,”  so  far  as  the 
two  algebras  proceed  in  common,  and  by  the  principle  of  the  permanence  of 
equivalent  forms  in  all  other  cases.  The  principle  of  permanence  of  equivalent 
forms  was  regarded  as  a  powerful  concept  in  mathematics,  and  it  played  a 
historical  role  in  such  matters  as  the  early  development  of  the  arithmetic  of 
the  complex  number  system  and  the  extension  of  the  laws  of  exponents  from 
positive  integral  exponents  to  exponents  of  a  more  general  kind.  In  the 
theory  of  exponents,  for  example,  if  a  is  a  positive  rational  number  and  n 
is  a  positive  integer,  then  an  is,  by  definition,  the  product  of  n  a’s.  From  this 
definition  it  readily  follows  that,  for  any  two  positive  integers  m  and  n, 
aman  =  am+n.  By  the  principle  of  permanence  of  equivalent  forms,  Peacock 
affirmed  that  in  “symbolical  algebra,”  aman  =  am+n ,  no  matter  what  might 
be  the  nature  of  the  base  a  or  of  the  exponents  m  and  n.  The  hazy  principle 
of  permanence  of  equivalent  forms  has  today  been  scrapped,  but  we  are 
still  often  guided,  when  attempting  to  extend  a  definition,  to  formulate  the 
more  general  definition  in  such  a  way  that  most  properties  of  the  old 
definition  will  be  preserved. 

British  contemporaries  of  Peacock  advanced  his  studies  and  pushed 
the  notion  of  algebra  closer  to  the  modern  concept  of  the  subject.  Thus 
Duncan  Farquharson  Gregory  (1813-1844)  published  a  paper  in  1840  f 
in  which  the  commutative  and  distributive  laws  in  algebra  were  clearly 

*G.  Peacock  [1]  and  [2]  (Bibliography). 

fD.  F.  Gregory. 
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brought  out.  Further  advances  in  an  understanding  of  the  foundations  of 
algebra  were  made  by  Augustus  De  Morgan  (1806-1871),*  another  member 
of  the  British  school  of  algebraists.  In  the  somewhat  groping  work  of  the 
British  school  one  can  trace  the  emergence  of  the  idea  of  algebraic  structure 
and  the  preparation  for  the  postulational  program  in  the  development  of 
algebra.  Soon  the  ideas  of  the  British  school  spread  to  continental  Europe 
where,  in  1867  they  were  considered  with  great  thoroughness  by  the  German 
historian  of  mathematics,  Hermann  Hankel  (1839—1873).  f  But,  even  before 
Hankel’s  treatment  appeared,  the  Irish  mathematician  William  Rowan 
Hamilton  (1805-1865)  and  the  German  mathematician  Hermann  Gunther 
Grassmann  (1809-1877)  had  published  results  which  were  of  a  far-reaching 
character,  results  which  led  to  the  liberation  of  algebra  in  much  the  same 
way  that  the  discoveries  of  Lobachevsky  and  Bolyai  led  to  the  liberation  of 
geometry,  and  which  opened  the  flood  gates  of  modern  abstract  algebra. 
This  remarkable  work  of  Hamilton  and  Grassmann  will  be  considered  in 
the  next  section. 

Before  passing  to  the  liberation  of  algebra  by  Hamilton  and  Grassmann, 
it  might  be  well  to  consider  a  modern  set  of  postulates  for  common  algebra. 
This  structure  is  today  technically  called  an  ordered  field. 

A  field  is  a  set  S  of  elements,  along  with  two  binary  operations  on  S , 
here  denoted  by  ©  and  (X)  so  that  we  will  not  necessarily  think  of  them  as 
ordinary  addition  and  multiplication,  satisfying  the  following  postulates.  J 
For  the  present,  equality  is  used  in  the  sense  of  identity;  thus  a  =  b  means 
a  and  b  are  the  same  element. 


PI: 

If  a 

and  b 

are 

in  S,  then  a 

b  =  b© 

a. 

P2: 

If a 

and  b 

are 

in  S,  then  a 

b  =  b  © 

a. 

P3: 

//a, 

b,  c 

are 

in  S,  then  (a 

® 

b)  ©  c  = 

a 

© 

(b 

© 

c). 

P4: 

If  a, 

b,  c 

are 

in  S,  then  (a 

® 

b)  ©  c  = 

a 

® 

(b 

© 

c). 

P5: 

//  a 

b,  c 

are 

in  S,  then  a 

® 

(b  ©  c) 

= 

(a 

® 

b) 

©  (a  ©  c)  and 

(b  ©  c)  0  a  =  (b  ©  a)  ©  (c  ©  a). 

P6:  S  contains  an  element  z  (zero)  such  that  for  any  element  a  of  S,  a  ©  z  =  a. 

P7 :  S  contains  an  element  u  (unity),  different  from  z,  such  that  for  any  element 
a  of  S,  a  ©  u  =  a. 

P8:  For  each  element  a  in  S  there  exists  an  element  a  in  S  such  that  a  ©  a  =  z. 

P9:  If  a,  b,  c  are  in  S,  c  z,  and  c©a  =  c  ©  b  or  a  ©  c  =  b©c, 
than  a  =  b.  (These  are  the  cancellation  laws  for  the  operation  ©.) 

P10:  For  each  element  a  ^  z  in  S,  there  exists  an  element  a-1  in  S  such  that 
a  ©  a-1  =  u. 

*A.  De  Morgan. 

fH.  Hankel. 

iit  is  customary  to  place  the  symbol  designating  a  binary  operation  between  the 

two  elements  upon  which  it  operates. 
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If,  in  addition  to  the  above  ten  postulates,  the  following  two  postulates 
hold,  the  field  is  called  an  ordered  field. 

PI  1 :  There  exists  a  subset  P,  not  containing  z,  ofi  the  set  S  such  that  if  a  X  z 
then  one  and  only  one  of  a  and  a  is  in  P. 

PI  2:  If  a.  and  b  are  in  P,  than  a  ©  b  and  a  ©  b  are  in  P. 

Definition  1 :  The  elements  of  P  are  known  as  the  positive  elements  of 
S ;  all  other  nonzero  elements  of  S  are  known  as  the  negative  elements  of  S. 

Definition  2:  If  a  and  b  are  elements  of  S  and  if  a  ©  b  is  positive,  then 
we  write  a  ©  b  and  b  ©  a. 

The  postulate  set  for  a  field  has  been  made  somewhat  redundant  in 
order  that  it  may  serve  a  future  purpose.  For  example,  in  view  of  Postulate 
P2,  only  one  of  the  distributive  laws  in  Postulate  P5  needed  to  be  given. 
Also  it  can  be  shown  (see  Appendix,  Section  A. 2)  that  the  entire  Postulate 
PI  and  the  entire  Postulate  P9  are  redundant.  Note  that  our  original  five 
basic  properties  of  the  positive  integers  appear  as  Postulates  PI,  P2,  P3,  P4, 
P5,  above. 

The  twelve  postulates  for  an  ordered  field  constitute  a  postulate  set  for 
elementary  algebra.  The  development  of  elementary  algebra  from  these  pos¬ 
tulates  would  be  an  edifying,  though  tedious,  undertaking.  Such  a  develop¬ 
ment  would  be  too  abstract  to  employ  at  the  high  school  level,  where 
elementary  algebra  is  first  taught,  but  its  understanding  is  essential  to  a 
comprehension  of  trends  in  modern  mathematics. 

It  is  an  easy  matter  for  the  reader  to  show  that  any  set  of  complex 
numbers,  in  which  the  sum,  difference,  product,  and  quotient  (with  denomi¬ 
nator  different  from  0)  of  any  two  numbers  in  the  set  are  again  in  the  set, 
constitutes  a  field  under  the  binary  operations  +  and  X.  Such  a  field  is 
known  as  a  number  field.  This  explains  why  ordinary  addition,  subtraction, 
multiplication,  and  division,  applied  to  numbers,  are  sometimes  referred  to 
as  the  four  field  operations.  In  the  Appendix,  Section  A. 3,  we  employ  some 
elementary  theory  of  number  fields  to  prove  the  impossibility  of  trisecting 
an  arbitrary  angle,  of  duplicating  a  cube,  and  of  squaring  a  circle  with  only 
the  Euclidean  straightedge  and  compasses. 

Two  very  important  examples  of  ordered  fields  are  the  set  of  all  rational 
numbers  and  the  set  of  all  real  numbers,  together  with  the  familiar  operations 
of  addition  and  multiplication  performed  on  these  numbers.  We  leave  it 
to  the  reader  to  verify  this  by  checking  the  twelve  postulates  listed  above. 

5.2  The  Liberation  of  Algebra 

Geometry,  we  have  seen,  remained  shackled  to  Euclid’s  version  of  the 
subject  until  Lobachevsky  and  Bolyai,  in  1829  and  1832,  liberated  it  from  its 
bonds  by  creating  an  equally  consistent  geometry  in  which  one  of  Euclid’s 
postulates  fails  to  hold.  With  this  accomplishment,  a  deep-rooted  and 
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centuries-old  conviction  that  there  could  be  only  the  one  possible  geometry 
was  shattered,  and  the  way  was  opened  for  the  creation  of  many  different 
systems  of  geometry.  The  same  sort  of  story  can  be  told  of  algebra.  It  seemed 
inconceivable,  in  the  early  nineteenth  century,  that  there  could  exist  an 
algebra  different  from  the  common  algebra  of  arithmetic.  To  attempt,  for 
example,  the  construction  of  a  consistent  algebra  in  which  the  commutative 
law  of  multiplication  fails  to  hold,  not  only  probably  did  not  occur  to 
anyone  of  the  time,  but  had  it  occurred  would  surely  have  been  dropped  as  a 
purely  ridiculous  idea;  after  all,  how  could  one  possibly  have  a  logical 
algebra  in  which  a  X  b  is  not  equal  to  b  X  a?  Such  was  the  feeling  about 
algebra  when,  in  1843,  William  Rowan  Hamilton  was  forced,  in  efforts  to 
work  out  an  algebra  suitable  to  the  physicist  for  space  analysis,  to  invent 
an  algebra  in  which  the  commutative  law  of  multiplication  does  not  hold.  * 
The  radical  step  of  abandoning  the  commutative  law  did  not  come  easily 
to  Hamilton;  it  dawned  on  him  only  after  years  of  cogitation  on  a  particular 
problem. 

It  would  take  us  too  far  afield  to  go  into  the  physical  motivation  that  lay 
behind  Hamilton’s  creation.  Perhaps  the  best  approach,  for  our  purposes,  is 
through  Hamilton’s  elegant  treatment  of  complex  numbers  as  real  number 
pairs,  f  To  the  mathematicians  of  his  time,  a  complex  number  was  considered 
a  number  of  the  form  a  -f-  bi ,  where  a  and  b  are  real  numbers  and  i2  =  —  1 . 
Addition  and  multiplication  of  complex  numbers  was  accomplished  by 
treating  a  -f-  bi  as  a  linear  polynomial  in  i,  and  by  replacing  i 2,  wherever 
it  might  occur,  by  —1.  One  finds  in  this  way,  for  addition, 

{a  -R-  bi)  +  (c  +  di )  =  {a  -f-  c)  ~{-  {b  T"  d)i , 
and,  for  multiplication, 

{a  -f-  bi)  (c  -j-  di)  =  ac  +  adi  T  bci  +  bdi2  =  {ac  —  bd)  +  {ad  +  bc)i. 

If  these  results  should  be  taken  as  definitions  for  the  addition  and  multipli¬ 
cation  of  pairs  of  complex  numbers,  it  is  not  difficult  to  show  that  addition 
and  multiplication  are  commutative  and  associative,  and  that  multiplication 
is  distributive  over  addition. 

Now,  since  a  complex  number  a  -f-  bi  is  completely  determined  by  the 
two  real  numbers  a  and  b,  it  occurred  to  Hamilton  to  represent  the  complex 
number  by  the  ordered  real  number  pair  {a,  b).  He  defined  two  such  number 
pairs  {a,  b)  and  {c,  d)  to  be  equal  if  and  only  if  a  =  c  and  b  =  d.  Addition 
and  multiplication  of  such  number  pairs  he  defined  (to  agree  with  the  above 
results)  to  be  ( a ,  b)  +  ( c ,  d)  =  {a  T  c,  b  -fi  d)  and  ( a ,  b)  (c,  d)  =  {ac  —  bd, 
ad  +  be).  With  these  definitions  it  is  easy  to  show  that  addition  and  multi- 

*Hamilton  later  considerably  expanded  these  ideas.  See  W.  R.  Hamilton  [1]  and  [2]. 

fCommunicated  by  Hamilton  in  1833  and  1835  to  the  Royal  Irish  Academy,  and 

published  in  1837, 
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plication  of  the  ordered  real  number  pairs  are  commutative  and  associative, 
and  that  multiplication  is  distributive  over  addition,  if  one  assumes,  of  course, 
that  these  laws  hold  for  the  ordinary  addition  and  multiplication  of  real 
numbers.  In  fact,  assuming  that  the  real  numbers,  under  ordinary  addition 
and  multiplication,  satisfy  all  the  postulates  of  a  field  (as  given  in  the  last 
section),  one  can  show  that  Hamilton’s  number  pairs,  under  his  definitions 
of  addition  and  multiplication,  also  satisfy  all  the  postulates  of  a  field. 

It  is  to  be  noted  that  the  real  number  system  is  embedded  in  the  complex 
number  system.  By  this  statement  is  meant  that  if  each  real  number  r  is 
identified  with  the  corresponding  number  pair  (r,  0),  then  the  correspondence 
is  preserved  under  addition  and  multiplication  of  complex  numbers,  for 
we  have 

(a, 0)  +  (b, 0)  =  (a  +  b,  0)  and  (a, 0)  (b, 0)  =  (ab, 0). 

In  practice,  a  complex  number  of  the  form  (r,  0)  can  be  replaced  by  its 
corresponding  real  number  r. 

To  obtain  the  older  form  of  a  complex  number  from  Hamilton’s  form, 
we  note  that  any  complex  number  ( a ,  b)  can  be  written  as 

(a,b)  =  (a, 0)  +  (0,*)  =  (a,0)  +  (b,0)  {0,1)  =  a  bi , 

where  (0,1)  is  represented  by  the  symbol  i,  and  (a,  0)  and  (b,  0)  are  identified 
with  the  real  numbers  a  and  b.  Finally,  we  see  that 

z2  =  (0,1)  (0,1)  =  (-1,0)  =  -1. 

The  complex  number  system  is  a  very  convenient  number  system  for 
the  study  of  vectors  and  rotations  in  the  plane.  *  Hamilton  was  attempting 
to  devise  an  analogous  system  of  numbers  for  the  study  of  vectors  and 
rotations  in  three-dimensional  space.  In  his  researches  he  was  led  to  the 
consideration,  not  of  ordered  real  number  pairs  ( a ,  b)  having  the  real  numbers 
embedded  within  them,  but  of  ordered  real  number  quadruples  (a,  b ,  r,  d) 
having  both  the  real  and  the  complex  numbers  embedded  within  them. 
In  other  words,  defining  two  such  quadruples  ( a ,  b,  c,  d)  and  (e,  /,  g,  h) 
to  be  equal  if  and  only  if  a  —  e,  b  —  f,  c  =  g,  d  —  h,  Hamilton  found  it 
necessary  to  define  an  addition  and  multiplication  of  ordered  real  number 
quadruples  in  such  a  way  that,  in  particular,  he  would  have 

(fl, 0,0,0)  +  (6, 0,0,0)  =  (a  +  b,  0,  0,  0), 

(a, 0,0,0)  (6, 0,0,0)  =  (aA, 0,0,0), 

(a, b, 0,0)  +  (r,^,0,0)  =  (a  +  c,  b  +  d,  0,  0), 

(a,b, 0,0)  (c,d, 0,0)  =  (ac  —  bd ,  ad  +  be ,  0,  0). 

*This  convenience  results  from  the  fact  that  when  a  complex  number  z  =  a  +  bi  is 
considered  as  representing  the  point  £  having  rectangular  Cartesian  coordinates 
(a,  b ),  then  the  complex  number  z  may  also  be  regarded  as  representing  the  vector 
OZ,  where  O  is  the  origin  of  coordinates. 
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Calling  such  ordered  real  number  quadruples,  (real)  quaternions ,  Hamilton 
found  he  had  to  formulate  the  following  definitions  for  addition  and  mul¬ 
tiplication  of  his  quaternions: 

0 a,b,c,d )  +  ( e,f,g,h )  =  (a  +  e}  b  +  /,  c  +  g,  d  +  h), 

(a,b,c,d)  ( e,f,g,h )  =  (ae  -  bj  -  eg  -  dh ,  af  -f  ^  +  ch  -  dg , 

+  df  bh,  ah  -\-  bg  -\-  de  —  cf). 

It  can  be  shown,  with  these  definitions,  that  the  real  numbers  and  the 
complex  numbers  are  embedded  among  the  quaternions,  and  that  if  we 
identify  the  quaternion  (m, 0,0,0)  with  the  real  number  m,  then 

m(a,b)c,d)  —  ( a}b,c,d)m  =  (ma,mb,mc,md) . 

It  can  also  be  shown  that  addition  of  quaternions  is  commutative  and  asso¬ 
ciative,  and  that  multiplication  of  quaternions  is  associative  and  distributive 
over  addition.  But  the  commutative  law  for  multiplication  fails  to  hold. 
To  see  this,  consider,  in  particular,  the  two  quaternions  (0,1, 0,0)  and 
(0,0, 1,0).  One  finds  that 

(0,1, 0,0)  (0,0, 1,0)  =  (0,0, 0,1), 

while 

(0,0, 1,0)  (0,1, 0,0)  =  (0,0,0,  -1)  =  ”(0,0, 0,1); 

that  is,  the  commutative  law  for  multiplication  is  broken.  In  fact,  if  we 
represent  by  the  symbols  1,  i,j,  k,  respectively,  the  quaternionic  units  (1,0, 0,0), 
(0,1, 0,0),  (0,0, 1,0),  (0,0, 0,1),  we  can  verify  that  the  following  multiplication 
table  prevails;  that  is,  the  desired  product  is  found  in  the  box  common  to 
the  row  headed  by  the  first  factor  and  the  column  headed  by  the  second 
factor: 


X 

1 

i 

J 

k 

1 

1 

i 

j 

k 

i 

i 

-1 

k 

-J 

j 

j 

—  k 

-1 

i 

k 

k 

j 

—  i 

-1 

Hamilton  has  told  the  story  that  the  idea  of  abandoning  the  commutative 
law  of  multiplication  came  to  him  in  a  flash,  after  fifteen  years  of  fruitless 
meditation,  while  standing  on  a  bridge  near  Dublin.  He  was  so  struck  by  the 
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unorthodoxy  of  the  idea  that  it  is  said  he  scratched  the  gist  of  the  above 
multiplication  table  into  the  stone  of  the  bridge. 

We  can  write  the  quaternion  ( a,b,c,d )  in  the  form  a  +  bi  -p  cj  T  dk. 
When  two  quaternions  are  written  in  this  form  they  may  be  multiplied  like 
polynomials  in  z,  j,  k ,  and  then  the  resulting  product  put  into  the  same  form 
by  means  of  the  above  -multiplication  table. 

In  the  year  1844,  Hermann  Grassmann  published  the  first  edition  of  his 
remarkable  Ausdehnungslehre ,  *  in  which  were  developed  classes  of  algebras 
of  much  greater  generality  than  Hamilton’s  quaternion  algebra.  Instead  of 
considering  just  ordered  sets  of  quadruples  of  real  numbers,  Grassmann 
considered  ordered  sets  of  n  real  numbers.  To  each  such  set  (*i,  x2,  •  •  •,  xn) 
Grassmann  associated  a  hypercomplex  number  of  the  form  x\e\  +  x2e2 
-f  *  •  •  +  xnen,  where  eh  e2,  •••,*„  are  the  fundamental  units  of  his  algebra. 
Two  such  hypercomplex  numbers  are  added  and  multiplied  like  polynomials 
in  <?i,  62,  •  •  en.  The  addition  of  two  such  numbers  yields,  then,  a  number 
of  the  same  kind.  To  make  the  product  of  two  such  numbers  a  number  of 
the  same  kind  requires  the  construction  of  a  multiplication  table  for  the 
units  ei,  •••,  en  similar  to  Hamilton’s  multiplication  table  for  his  units 
1,  z,  /,  k.  Here  one  has  considerable  freedom,  and  different  algebras  can  be 
created  by  making  different  multiplication  tables.  The  multiplication  table 
is  governed  by  the  desired  application  of  the  algebra  and  by  the  laws  of 
algebra  one  wishes  to  preserve. 

We  shall  not  go  deeper  here  into  either  Hamilton’s  or  Grassmann’s 
work.  By  developing  algebras  satisfying  laws  different  from  those  obeyed  by 
common  algebra,  these  men  opened  the  way  for  the  study  of  innumerable 
algebraic  structures.  By  weakening  or  deleting  various  postulates  of  common 
algebra,  or  by  replacing  one  or  more  of  the  postulates  by  others,  which  are 
consistent  with  the  remaining  postulates,  an  enormous  variety  of  systems 
can  be  studied.  For  example,  |  the  deletion  of  Postulate  P2  from  the  postulates 
for  a  field,  as  listed  in  the  previous  section,  leads  to  an  algebraic  structure 
technically  known  as  a  division  ring ,  or  sfield.%  The  deletion  of  Postulates 
P2,  P7,  P10  leads  to  an  integral  domain ;  the  deletion  of  Postulates  P2,  P9, 
P10  to  a  ring  with  unity ;  the  deletion  of  Postulates  P7,  P9,  P10  to  a  com¬ 
mutative  ring ;  and  the  deletion  of  Postulates  P2,  P7,  P9,  PI 0  to  a  ring  (with 
no  qualifying  phrase  or  adjective).  In  the  next  section,  we  shall  consider  a 
particularly  basic  and  important  algebraic  structure,  obtained  in  a  similar 
way,  known  as  a  group.  It  is  probably  correct  to  say  that  mathematicians 
have  studied  well  over  200  such  algebraic  structures.  As  the  American 
algebraists  Garrett  Birkhoff  and  Saunders  MacLane  have  written,  ‘‘Modern 

*H.  G.  Grassmann. 

fThe  reader  should  be  warned  that  there  are  slight  variations  in  these  definitions 

as  given  by  different  writers. 

Jit  can  be  shown  that  quaternions  constitute  a  division  ring. 
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algebra  has  exposed  for  the  first  time  the  full  variety  and  richness  of  possible 
mathematical  systems.”* 

Before  closing  this  section,  let  us  consider  one  more  noncommutative 
algebra— the  matric  algebra  devised  by  the  English  mathematician  Arthur 
Cayley  (1821-1895)  in  1 857.  f  Matrices  arose  with  Cayley  in  connection  with 
linear  transformations  of  the  type 

x'  =  ax  +  by, 

=  cx  +  dy, 

where  a,  b,  c,  d  are  real  numbers,  and  which  may  be  thought  of  as  mapping 
the  point  {x,  y)  into  the  point  (*',/).  Clearly,  the  above  transformation  is 
completely  determined  by  the  four  coefficients  a,  b,  c,  d,  and  so  the  trans¬ 
formation  can  be  symbolized  by  the  square  array 

a  b 


which  we  shall  call  a  {square)  matrix  {of  order  2).  Since  two  transformations 
of  the  kind  under  consideration  are  identical  if  and  only  if  they  possess  the 
same  coefficients,  we  define  two  matrices 


a  b 
c  d 


and 


e  f 

.g  h. 


to  be  equal  if  and  only  if  a  =  e,  b  —  /,  c  =  g,  d  =  h.  If  the  transformation 
given  above  is  followed  by  the  transformation 


rr  t  I  r  / 

x  =  ex'  T  jy  , 

y”  =  gxf  +  y, 

the  result  can  be  shown,  by  elementary  algebra,  to  be  the  transformation 


x"  =  {ea  -|-  fc)x  +  (eb  +  fd)y, 
y"  =  (ga  +  hc)x  +  {gb  +  hd)y. 

This  motivates  the  following  definition  for  the  product  of  two  matrices: 


e  f 

a  b 

ea  +  fc 

eb  -f-  fd 

-g  h _ 

c  d 

_ga  +  he 

gb  +  hd 

^Quoted  by  permission  from  Garrett  Birkhoff  and  Saunders  MacLane,  A  Survey  of 
Modern  Algebra  (New  York:  The  Macmillan  Company,  1941),  p.  1. 
fA.  Cayley. 
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Addition  of  matrices  is  defined  by 


a  b 

+ 

*  / 

_ 

a  Ar  e 

b+f 

c  d 

_£  h- 

J  +  g 

d  +  h_ 

and,  if  m  is  any  real  number,  we  define 


a  b 

a 

b 

ma 

mb 

r~ 

1 _ 

c 

d _ 

m  = 

me 

md 

In  the  resulting  algebra  of  matrices,  it  may  be  shown  that  addition  is  both 
commutative  and  associative  and  that  multiplication  is  associative  and 
distributive  over  addition.  But  multiplication  is  not  commutative,  as  is 
shown  by  the  simple  example: 


I 

o 

0  1 

1 

o 

1 _ 

I 

o 

1 _ 

1 

1 — i 

o 

_ 1 

1 

o 

o 

_ J 

r~ 

o 

o 

L 

1 

o 

1 _ 

1 

o 

o 

_ 1 

5 

1 

o 

1 _ 

1 

o 

o 

_ 1 

1 

o 

<2 

There  is  another  law  of  common  algebra,  besides  the  commutative  law 
of  multiplication,  that  is  broken  in  Cayley’s  matric  algebra,  and  this  is  the 
cancellation  law  for  multiplication.  Thus,  although* 


1  0 

"0 

1 

1  0 

0  1 

1 

o 

o 

1 _ 

_0 

1_ 

i 

o 

o 

_ 1 

_1  0_ 

we  do  not  have 


0 

1 

and 

0 

1 

_0 

1_ 

_1 

0_ 

0  1 
0  0 


equal  to  each  other. 

Although  a  number  of  algebras  in  which  multiplication  is  noncommuta- 
tive  were  devised  in  the  middle  nineteenth  century,  algebras  in  which  multi¬ 


plication  is  nonassociative  are,  for  the  most  part,  of  rather  recent  origin. 
As  examples  of  such  algebras  we  have  Jordan  algebras  and  Lie  algebras.  A 
(special)  Jordan  algebra,  which  is  used  in  quantum  mechanics,  has  matrices 
for  elements,  with  equality  and  addition  defined  as  in  Cayley  s  matric  algebra, 
but  with  the  product  of  two  matrices  A  and  B  defined  as  (AB  +  BA)/2, 
where  AB  stands  for  Cayley’s  product  of  the  two  matrices  A  and  B.  Although 
multiplication  in  this  algebra  can  be  shown  to  be  nonassociative,  it  is 
obviously  commutative.  A  Lie  algebra  differs  from  the  above  Jordan  algebra 
in  that  the  product  of  the  two  matrices  A  and  B  is  defined  by  AB  —  BA , 
where  again  AB  denotes  the  Cayley  product  of  the  matrices  A  and  B.  In 
this  algebra,  multiplication  is  neither  associative  nor  commutative. 


*In  Cayley’s  matric  algebra,  the  matrix 


0 

0 


0 

0 


plays  the  role  of  zero. 
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5.3  Groups 

A  group ,  which  is  one  of  the  simplest  algebraic  structures  of  consequence, 
is  a  nonempty  set  G  of  elements  in  which  a  binary  operation  *  is  defined 
satisfying  the  following  three  postulates: 

G1 :  For  all  a,  b,  c  in  G,  (a  *  b)  *  c  =  a  *  (b  *  c). 

G2:  There  exists  an  element  i  of  G  such  that ,  for  all  a  in  G,  a  *  i  =  a. 
(The  element  i  is  called  an  identity  element  of  the  group.  Later  we  shall  prove 
that  a  group  possesses  only  one  identity  element.) 

G3:  For  each  element  a  of  G  there  exists  an  element  a-1  of  G  such  that 
a  *  a-1  =  i.  (The  element  ary  is  called  an  inverse  element  of  a.  Later  we  shall 
prove  that  an  element  a  of  a  group  possesses  only  one  inverse  element.) 

If,  in  addition  to  the  above  three  postulates,  the  following  postulate  is 
satisfied,  the  group  is  called  a  commutative ,  or  an  Abelian ,  group. 

G4:  For  all  a,  b  in  G,  a  *  b  =  b  *  a. 

A  group  for  which  Postulate  G4  does  not  hold  is  called  a  non- Abelian 
group.  If  the  set  G  of  a  group  contains  only  a  finite  number  of  distinct  elements 
the  group  is  called  a  finite  group ;  otherwise  it  is  called  an  infinite  group.  For 
some  purposes  the  much  simpler  concept  of  a  semigroup  is  important;  it  is 
a  set  G  of  elements  in  which  a  binary  operation  *  is  defined  satisfying  the 
single  Postulate  Gl.  If,  in  addition,  Postulate  G4  is  satisfied,  the  semigroup 
is  called  an  Abelian  semigroup.  Illustrations  of  groups  are  numerous  and 
diverse,  as  the  following  examples  testify. 


_ EXAMPLES 

(a)  Let  G  be  the  set  of  all  integers,  and  let  *  denote  ordinary  addition.  Here  the 
integer  0  is  the  identity  element,  and  the  inverse  element  of  a  given  integer  a 
is  its  negative.  This  is  an  example  of  an  infinite  Abelian  group. 

(b)  Let  G  be  the  set  of  all  rational  numbers  with  0  omitted,  and  let  *  denote  ordinary 
multiplication.  Here  the  rational  number  1  is  the  identity  element,  and  the 
inverse  element  of  a  given  rational  number  a  is  its  reciprocal  1  la.  This  is  another 
example  of  an  infinite  Abelian  group. 

(c)  Let  G  be  the  set  of  all  translations 

x'  =  x  A~  h, 

T: 

/  =  y  +  k, 

where  h  and  k  are  real  numbers,  of  the  coordinate  plane,  and  let  T 2  *  T\  denote 
the  result  of  performing  first  translation  and  then  translation  TV  If  T\  and  T2 
are  the  translations 

T  —  x  +  hi,  x'  =  x  +  hi, 

F\\  TV 

/  =  y  +  ku  /  =  y  +  k-2, 
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then  it  is  easy  to  show  that  73  *  73  is  the  transformation 

x'  =  x  -f-  -f-  A2), 

/  =  V  +  (ki  +  A;2), 

which  is  again  a  translation.  One  can  easily  show  that  the  operation  *  is  asso¬ 
ciative.  The  identity  dement  is  the  translation  in  which  h  =  k  =  0,  and  the 
inverse  of  translation  T  is  the  translation 

x'  =  x  —  A, 

r-1: 

/  =  y  —  k- 

This,  too,  is  an  example  of  an  infinite  Abelian  group. 

(d)  Let  G  be  the  set  of  four  numbers  1 ,  —  1 ,  i,  —i,  where  i 2  =  —  1 ,  and  let  *  denote 
ordinary  multiplication.  Here  1  is  the  identity  element,  and  the  inverse  elements 
of  1,  —l,i,  —i  are  1,  —1,  —i,  i,  respectively.  This  is  an  example  of  a  finite 
Abelian  group. 

(e)  Let  G  be  the  set  of  four  integers  1,  2,  3,  4,  and  let  a  *  b  denote  the  remainder 
obtained  by  dividing  the  ordinary  product  of  a  and  b  by  5.  We  may  represent 
all  possible  values  of  a  *  b  by  means  of  the  operation  table: 


* 

1 

2 

3 

4 

1 

1 

2 

3 

4 

2 

2 

4 

1 

3 

3 

3 

1 

4 

2 

4 

4 

3 

2 

1 

Here  the  value  of  a  *  b  is  found  in  the  box  common  to  the  row  headed  oy  a 
and  the  column  headed  by  b.  Note  that  1  is  the  identity  element,  and  that  the 
inverse  elements  of  1,  2,  3,  4  are  1,  3,  2,  4,  respectively.  This  is  another  example 
of  a  finite  Abelian  group.  An  operation  table,  like  the  above,  can  be  made  for 
any  finite  group.  If  the  table  is  symmetrical  in  its  principal  diagonal,  as  in  the 
present  example,  the  group  is  Abelian;  otherwise  it  is  non-Abelian.  The  unit 
element  is  the  element  that  heads  the  column  which  exactly  repeats  the  column 
of  row  headings.  To  find  the  inverse  of  an  element  a ,  we  merely  travel  across 
the  row  headed  by  a  until  we  come  to  the  identity  element;  the  element  heading 
the  column  we  are  now  in  is  a~l.  There  is  no  simple  way  of  checking  the  asso¬ 
ciativity  of  the  operation  *  from  the  table. 

(f)  Let  G  be  the  set  of  all  rotations  of  a  wheel  about  its  axis  through  angles  which 
are  non-negative  integral  multiples  of  60°,  where  a  rotation  of  (6 n  -|-  k)  60  , 
where  n  and  k  are  non-negative  integers  with  k  <  6,  is  considered  the  same  as 
the  rotation  k  •  60°.  Let  a  *  b  denote  the  rotation  b  followed  by  the  rotation  a. 
This  is  an  example  of  a  finite  Abelian  group. 
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(g)  Let  G  be  the  set  of  all  2  by  2  matrices 

a  b 
_c  d_ 

where  <2,  b ,  c,  d  are  rational  numbers  such  that  ad  —  be  9^  0,  and  let  *  denote 
ordinary  (Cayley)  matric  multiplication  (see  the  last  section).  Here  the  matrix 

"1  0“ 

_°  X- 

is  the  identity  element,  and  the  inverse  element  of  matrix 

a  b 
_c  d_ 

is  the  matrix 

d/(ad  —  be)  —b/(ad  —  be) 

_  —  c/(ad  —  be)  a /(ad  —  be)  _ 

This  is  an  example  of  an  infinite  non-Abelian  group. 

(h)  Let  G  be  the  set  of  six  expressions 

1  1  r  -  1  r 

f  —  1  —  r  -  -  - 


and  let  a  *  b  denote  the  result  of  substituting  the  expression  b  in  place  of  r  in 
the  expression  a.  For  example, 


[1 


1 


1  -  r 


This  is  an  example  of  a  finite  non-Abelian  group. 

Several  things  are  accomplished  by  the  above  exhibition  of  examples  of 
groups.  First  of  all,  the  examples  establish  the  existence  of  both  finite  and 
infinite  Abelian  groups  and  of  both  finite  and  infinite  non-Abelian  groups. 
Next,  the  existence  of  non-Abelian  groups  establishes  the  independence  of 
Postulate  G4  from  the  other  three  postulates.  Finally,  the  various  represen¬ 
tations  of  Abelian  groups  establish  the  consistency  of  Postulates  Gl,  G2, 
G3,  G4.  Since  some  of  these  representations  (see  Examples  (d),  (e),  (f)) 
contain  only  a  finite  number  of  objects,  which  can  be  explicitly  exhibited 
one  by  one  and  denoted  by  symbols,  it  follows  that  we  have  established 
absolute  consistency  of  the  postulate  set  rather  than  just  relative  consistency. 
This  is  the  case  for  any  postulate  set  for  which  a  model  containing  only  a 
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finite  number  of  objects  can  be  exhibited,  as,  for  instance,  in  the  case  of 
the  postulates  for  projective  geometry  in  Section  4.5,  where  a  “finite”  model 
was  exhibited.  In  all  other  cases  the  method  of  models  can  do  no  more 
than  reduce  the  consistency  of  one  system  to  that  of  another.  The  important 
matter  of  consistency  of  postulate  sets  will  be  considered  more  fully  in  the 
next  chapter.  y 

Mathematicians  have  devised  many  postulate  sets  for  a  group,  all  of 
which,  of  course,  are  equivalent  to  one  another.  Frequently  more  is  assumed 
in  the  postulates  than  need  be.  For  example,  it  is  quite  common  to  assume, 
in  G2  and  G3,  the  commutativity  of  i  with  every  element  of  G  and  the  com¬ 
mutativity  of  each  element  a  of  G  with  its  inverse  element  a~l.  Since  it  seems 
to  be  aesthetically  more  satisfying  to  assume  as  little  as  possible,  we  have 
adopted  the  weaker  set  of  postulates  and  shall  prove  as  theorems  the  addi¬ 
tional  assumptions  about  commutativity  mentioned  above.  *  We  now  proceed 
to  do  this;  some  of  the  proofs  we  shall  leave  as  exercises  for  the  reader. 

Some  fundamental  theorems  of  groups 

Theorem  1.  If  a,  b,  c,  are  in  G  and  a  *  c  =  b  *  c,  then  a  =  b. 

By  G3  there  exists  c~l.  From  a  *  c  =  b  *  c,  we  then  have  (a  *  c)  *  c~1 
=  (b  *  c)  *  c-1,  or,  by  Gl,  a  *  (c  *  c~l)  =  b  *  (c  *  c~l).  Employing  G3, 

we  now  have  a  *  i  =  b  *  i,  whence  finally,  by  G2,  a  —  b. 

Theorem  2.  For  all  a  in  G,  i  *  a  =  a  *  i. 

By  G3  there  exists  a~l.  Hence,  applying  Gl,  G3,  G2,  G3  in  turn,  we 
have  (i  *  a)  *  a~l  —  i  *  {a  *  a-1)  =  i*i  =  i  —  a*  a  h  By  Theorem  1  we 

then  have  i  *  a  —  a.  But,  by  G2,  we  have  a  *  i  —  a.  It  now  follows  that 

x  ^  a  —  a  *  x . 

Theorem  3.  A  group  has  a  unique  identity  element. 

Let  i  and  j  be  two  identity  elements  for  the  group.  Then,  by  G2  applied 
to  the  identity  element  /,  i  *  J  —  ?•  Also,  by  Theorem  2,  i  *  j  —  J  *  i-  But, 
by  G2  applied  to  the  identity  element  i,  j  *  i  —  j.  It  now  follows  that  i  =  j. 

Theorem  4.  For  each  element  a  of  G,  a-1  *  a  =  a  *  a-1. 

By  Gl,  G3,  and  Theorem  2,  applied  in  turn,  (a~l  *  a)  *  a~x  = 

a~l  *  (a  *  a~l)  =  a~l  *  i  =  i  *  a~l.  Therefore,  by  Theorem  1,  a^1  *  a  =  i. 

But,  by  G3,  a  *  a~x  =  i.  It  now  follows  that  a~l  *  a  =  a  *  arx. 

Theorem  5.  If  a,  b,  c  are  in  G  and  c  *  a  =  c  *  b,  then  a  =  b. 

Theorem  6.  Each  element  of  a  group  has  a  unique  inverse  element. 

Theorem  7.  If  a  is  in  G,  then  (a-1)-1  =  a. 

Theorem  8.  If  a  and  b  are  in  G,  then  there  exist  unique  elements  x  and  y 

of  G  such  that  a  *  x  =  b  and  y  *  a  =  b. 

*  Actually,  however,  our  postulate  set  for  a  group  can  itself  be  further  weakened. 

See,  for  example,  E.  V.  Huntington  [1]. 
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5.4  The  Significance  of  Groups 

in  Algebra  and  Geometry 

Groups  and  semigroups  have  considerable  significance  in  the  founda¬ 
tions  of  mathematics.  We  shall  devote  the  present  section  to  a  brief  treatment 
of  the  fundamental  role  played  by  these  structures  in  algebra  and  geometry. 

The  importance  of  groups  and  semigroups  in  algebra  lies  largely  in  the 
fact  that  many  algebraic  systems  are  actually  groups  or  semigroups  with 
respect  to  one  or  more  of  the  binary  operations  of  the  systems.  In  other 
words,  many  algebraic  structures  contain  the  group  structure  or  the  semi¬ 
group  structure  within  them  as  a  substructure.  Groups  and  semigroups  are 
like  algebraic  atoms  from  which  many  algebraic  systems  can  be  constructed. 
These  ideas  are  illustrated  by  the  following  alternative  definitions*  of  ring, 
commutative  ring ,  ring  with  identity,  integral  domain,  division  ring ,  and  field  (see 
Section  5.2). 

A  ring  is  a  set  S  of  elements  for  which  two  binary  operations,  here  called 
addition  and  multiplication,  are  defined  such  that  (1)  C  is  an  Abelian  group 
under  addition,  with  identity  element  called  zero,  (2)  S'  is  a  semigroup  under 
multiplication,  (3)  the  two  distributive  laws  of  multiplication  over  addition 
hold. 

A  commutative  ring  is  a  ring  in  v/hich  the  multiplicative  semigroup  is 
Abelian. 

A  ring  with  unity  is  a  ring  in  which  the  multiplicative  semigroup  has  an 
identity  element. 

An  integral  domain  is  a  ring  in  which  the  nonzero  elements  constitute  a 
subsemigroup  of  the  multiplicative  semigroup. 

A  division  ring  (or  sfield )  is  a  ring  of  more  than  one  element  in  which 
the  nonzero  elements  constitute  a  subgroup  of  the  multiplicative  semigroup. 

A  field  is  a  division  ring  in  which  the  multiplicative  semigroup  is  Abelian. 
Otherwise  stated,  a  field  is  a  set  S  of  at  least  two  elements  for  which  two 
binary  operations,  here  called  addition  and  multiplication,  are  defined  such 
that  (1 )  the  elements  of  S  constitute  an  Abelian  group  under  addition,  (2)  the 
nonzero  elements  of  S  constitute  an  Abelian  group  under  multiplication, 
(3)  multiplication  is  both  right  and  left  distributive  over  addition,  f 

In  1872  Felix  Klein  made  a  stunning  application  of  groups  to  geometry, 
which  introduced  a  beautiful  order  into  the  then  existing  chaos  of  geometrical 
information. 

Klein’s  application  of  groups  to  geometry  depends  upon  the  concept 
of  a  nonsingular  transformation  of  a  set  S  of  elements  into  itself,  by  which  we 
simply  mean  a  correspondence  under  which  each  element  of  S  corresponds 
to  a  unique  element  of  S,  and  each  element  of  S  is  the  correspondent  of  a 

*  These  definitions  may  be  found  in  N.  Jacobson,  Chap.  2. 

|See  A.  H.  Lightstone. 
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unique  element  of  S.  Such  a  correspondence  is  said  to  be  one-to-one ,  *  and  may 
be  described  as  a  permutation  of  the  elements  of  S  among  themselves;  it  is  a 
special  kind  of  unary  operation  defined  on  the  set  S.  If,  in  a  nonsingular 
transformation  7  of  a  set  S  into  itself,  element  a  of  S  corresponds  to  element 
b  of  S,  we  say  that,  under  the  transformation  A,  a  is  carried  into  b. 

By  the  product ,  T2  Ai?  of  two  nonsingular  transformations  7 i  and  72 
of  a  set  S  of  elements  into  itself,  we  mean  the  resultant  transformation 
obtained  by  first  performing  transformation  7 i  and  then  transformation  72. 
The  product  of  two  nonsingular  transformations  is  not  necessarily  commu¬ 
tative,  as  is  instanced  by  taking  A  to  be  a  translation  of  a  distance  of  one 
unit  in  the  direction  of  the  positive  x-axis  applied  to  the  set  S  of  all  points 
in  the  (v,q’)-plane>  and  ?7  a  counterclockwise  rotation  of  the  set  S  through 
90°  about  the  origin  of  coordinates.  Under  727i  the  point  (1,0)  is  carried 
into  the  point  (0,2),  whereas  under  7i 72  it  is  carried  into  the  point  (1,1). 
But  a  product  of  nonsingular  transformations  is  associative,  for  if  Th  77,  77 
are  any  three  nonsingular  transformations  of  a  set  S  into  itself,  77(7774) 
and  (77  77)77  both  denote  the  resultant  transformation  obtained  by  first 
performing  77,  then  77,  then  77.  This  can  be  seen  by  following  the  fate 
under  these  transformations  of  some  arbitrary  element  a  of  S.  Thus,  suppose 
77  carries  element  a  into  element  b ,  72  carries  element  b  into  element  c, 
and  7 3  carries  element  c  into  element  d.  Then  77 77  carries  element  a  directly 
into  element  c  and  77  carries  element  c  into  element  7,  whence  77  (  72 7 1) 
carries  element  a  directly  into  element  d.  On  the  other  hand,  77  carries 
element  a  into  element  b  and  7777  carries  element  b  directly  into  element  7, 
whence  (77  77)77  also  carries  element  a  directly  into  element  7. 

Let  7  be  any  nonsingular  transformation  of  a  set  S  into  itself  which 
carries  each  element  d  of  ^  into  its  corresponding  element  b  of  S.  The 
transformation  which  undoes  transformation  7,  by  carrying  each  element 
b  of  S  back  into  its  original  element  a  of  S,  is  called  the  inverse  transformation 
of  transformation  7",  and  is  denoted  by  7— 7  The  product  of  7  and  7  1  is 
a  transformation  which  clearly  leaves  all  elements  of  S  unchanged;  such 
a  transformation  is  called  an  identical  transformation ,  and  will  be  denoted 
by  7  We  note  that  7/  =  7  for  all  7. 

We  may  now  prove  the  following  important  theorem: 

Theorem.  A  nonempty  set  T  of  nonsingular  transformations  of  a  set  S  into 
itself  constitutes  a  group  under  multiplication  of  transformations  if  (1)  the  product 
of  any  two  transformations  of  the  set  Y  is  in  the  set  F,  (2)  the  inverse  of  any  trans¬ 
formation  of  the  set  T  is  in  the  set  Y. 

To  establish  this  theorem  we  first  note  that,  since  the  product  of  any 

*More  generally,  if  S  and  T  are  two  sets,  then  by  a  one-to-one  correspondence  between 
the  elements  of  S  and  the  elements  of  T  we  mean  a  collection  of  ordered  pairs  (s,  t)  such  that 
each  element  j  of  S  occurs  as  a  first  element  exactly  once,  and  each  element  t  of  T  occurs 
as  a  second  element  exactly  once. 
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two  transformations  in  the  set  T  is  in  the  set  T,  multiplication  of  transfor¬ 
mations  is  a  binary  operation  defined  in  the  set  IT  This  binary  operation  is 
associative,  as  we  have  shown  above.  Also,  if  T  is  a  transformation  in  T,  then 
T~l  is  in  T.  But  TT~l  =  I  and  TI  —  T  for  all  T  of  IT  All  three  postulates 
for  a  group  are  thus  satisfied,  and  the  theorem  is  established.  Such  a  group 
of  nonsingular  transformations  is  briefly  referred  to  as  a  transformation  group . 

We  are  now  ready  to  give  Felix  Klein’s  famous  definition  of  a  geometry: 
A  geometry  is  the  study  of  those  properties  of  a  set  S  which  remain  invariant  when 
the  elements  of  set  S  are  subjected  to  the  transformations  of  some  transformation  group  IT 
This  very  general  concept  of  a  geometry  was  developed  by  Felix  Klein  in 
his  celebrated  Erlanger  Programm,  *  an  address  delivered  by  him  when  he 
accepted  a  chair  at  the  University  of  Erlangen  in  1872. 

To  illustrate  Klein’s  definition  of  a  geometry,  let  S  be  the  set  of  all 
points  of  an  ordinary  plane,  and  consider  the  set  T  of  all  transformations  of 
S  compounded  from  translations,  rotations,  and  reflections  in  lines.  Since 
the  product  of  any  two  such  transformations  and  the  inverse  of  any  such 
transformation  are  also  such  transformations,  it  follows  that  T  is  a  trans¬ 
formation  group.  The  resulting  geometry  is  ordinary  plane  Euclidean  metric 
geometry.  Since  such  properties  as  length,  area,  congruence,  parallelism,  per¬ 
pendicularity,  similarity  of  figures,  collinearity  of  points,  and  concurrence 
of  lines  are  invariant  under  the  group  T,  these  properties  are  studied  in  plane 
Euclidean  metric  geometry.  If,  now,  T  is  enlarged  by  including,  together 
with  the  translations,  rotations,  and  reflections  in  lines,  the  homothety 
transformations  (in  which  each  point  P  is  carried  into  a  point  P'  such  that 
AP  =  k  •  APj  where  A  is  some  fixed  point,  k  is  some  fixed  positive  constant, 
and  A,  P ,  P'  are  collinear),  we  obtain  plane  similarity ,  or  plane  equiform , 
geometry.  Under  this  enlarged  group  such  properties  as  length,  area,  and 
congruence  no  longer  remain  invariant,  and  hence  are  no  longer  subjects 
of  study,  but  parallelism,  perpendicularity,  similarity  of  figures,  collinearity 
of  points,  and  concurrence  of  lines  are  still  invariant  properties,  and  hence 
do  constitute  subject  matter  for  study  in  this  geometry.  Considered  from 
Klein’s  point  of  view,  plane  projective  geometry  is  the  study  of  those  prop¬ 
erties  of  the  points  of  a  plane  which  remain  invariant  when  the  points  are 
subjected  to  the  group  of  so-called  projective  transformations.  Of  the  pre¬ 
viously  mentioned  properties,  only  collinearity  of  points  and  concurrence 
of  lines  still  remain  invariant.  An  important  invariant  under  this  group  of 
transformations  is  the  cross  ratio  of  four  collinear  points;  this  invariant  plays 
an  important  role  in  the  study  of  projective  geometry.  The  plane  non- 
Euclidean  metric  geometries,  considered  in  earlier  chapters,  can  be  thought 
of  as  the  study  of  those  properties  of  the  points  of  a  non-Euclidean  plane 
which  remain  invariant  under  the  group  of  transformations  compounded 
from  translations,  rotations,  and  reflections  in  lines. 


*F.  Klein  [1]. 
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In  all  of  the  above  geometries,  the  fundamental  elements  upon  which 
the  transformations  of  some  transformation  group  are  made  to  act  are  points; 
hence  the  above  geometries  are  all  examples  of  so-called  point  geometries. 
There  are,  as  one  might  expect,  geometries  in  which  entities  other  than  points 
are  chosen  for  fundamental  elements.  Thus  geometers  have  studied  line 
geometries,  circle  geometries,  sphere  geometries,  and  various  other  geom¬ 
etries.  In  building  up  a  geometry  one  is  at  liberty  to  choose,  first  of  all, 
the  fundamental  element  of  the  geometry  (point,  line,  circle,  etc.);  next, 
the  manifold  or  space  of  these  elements  (plane  of  points,  ordinary  space  of 
points,  spherical  surface  of  points,  plane  of  lines,  pencil  of  circles,  etc.); 
and  finally,  the  group  of  transformations  to  which  the  fundamental  elements 
are  to  be  subjected.  The  construction  of  a  new  geometry  becomes,  in  this 
way,  a  rather  simple  matter. 

Another  interesting  feature  is  the  way  in  which  some  geometries  embrace 
others.  Thus,  since  the  transformation  group  of  plane  Euclidean  metric 
geometry  is  a  subgroup  of  the  transformation  group  of  plane  equiform 
geometry,  it  follows  that  any  theorem  holding  in  the  latter  geometry  must 
hold  in  the  former.  From  this  point  of  view  it  can  be  shown  that  projective 
geometry  lies  within  each  of  the  former,  and  we  have  a  sort  of  sequence  of 
nesting  geometries.  Until  recent  times,  the  transformation  group  of  pro¬ 
jective  geometry  contained  as  subgroups  the  transformation  groups  of  prac¬ 
tically  all  other  geometries  that  had  been  studied.  This  is  essentially  what 
Cayley  meant  when  he  remarked  that  “projective  geometry  contains  all 
geometry.”  Actually,  so  far  as  the  theorems  of  the  geometries  are  concerned, 
it  is  the  other  way  about — the  theorems  of  projective  geometry  are  contained 
among  the  theorems  of  each  of  the  other  geometries. 

The  various  geometries  when  considered  from  Klein’s  point  of  view 
can  perhaps  best  be  studied  by  the  methods  of  analytic  geometry,  where 
the  transformations  of  each  underlying  transformation  group  T  are  given 
by  formulas  connecting  the  new  and  the  old  coordinates  of  the  fundamental 
elements  of  the  space  under  consideration.  For  example,  the  transformation 
group  of  plane  projective  point  geometry  is  the  totality  of  all  transformations 

of  the  form 

ax  +  by  +  c 
x'  =  - 7 — ; — :  ’ 

gx  +  hy  +  i 

dx  -f  ey  +  / 

y  —  - - .  > 

gx  -f-  hy  +  i 

where  a,  b,  c,  •  •  •  are  any  real  numbers  satisfying  the  condition 

a  b  c 
d  e  j 

g  h  1 


7*  0. 
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If  T  is  taken  as  the  totality  of  all  transformations  of  the  form 

*'  =  y  =  g(x,y), 

where  f(x,y)  and  g(x,y)  are  continuous  and  single-valued,  and  where  the 
inverse  transformation  exists  and  is  also  continuous  and  single-valued,  we 
obtain  the  rather  recently  developed  branch  of  geometry  known  as  topology , 
or  analysis  situs ,  of  the  Euclidean  plane.  From  this  definition  one  can  see  why 
topology  of  the  plane  is  frequently  referred  to  as  “rubber-sheet”  geometry, 
for  in  stretching  or  contracting  a  rubber  sheet,  the  points  of  the  sheet  undergo 
just  such  a  bicontinuous  single-valued  transformation. 

Topology  is  perhaps  the  deepest  geometry  in  the  sense  considered  above; 
its  transformation  group  embraces  all  previous  transformation  groups 
mentioned,  and  any  theorem  of  topology  thus  holds  within  these  other 
geometries.  Since  the  topological  transformations  are  so  broad,  it  is  fair 
to  wonder  what  properties  of  the  plane  of  points,  say,  can  possibly  remain 
invariant  under  these  transformations.  To  give  a  few  simple  examples,  we 
might  mention,  first  of  all,  that  a  simple  closed  curve  (a  closed  curve  which 
does  not  cut  itself)  remains  a  simple  closed  curve  under  all  topological 
transformations.  Again,  the  fact  that  the  deletion  of  only  one  point  from  a 
simple  closed  curve  does  not  disconnect  the  curve  is  a  topological  property, 
and  the  fact  that  the  deletion  of  two  points  from  a  simple  closed  curve 
separates  the  curve  into  two  pieces  is  also  a  topological  property.  Frequent 
use  is  made  of  such  topological  properties  in  modern  treatments  of  function 
theory. 

A  deeper  and  more  rigorous  consideration  of  Klein’s  remarkable  syn¬ 
thesis  of  geometries  must  be  reserved  for  a  course  in  the  foundations  of 
geometry.  The  concept  is  very  useful  when  it  applies,  and  many  geometries 
can  be  considered  neatly  in  this  way.  But  shortly  after  the  turn  of  the  century, 
bodies  of  mathematical  propositions,  which  mathematicians  felt  should  be 
called  geometries,  came  to  light  which  cannot  be  fitted  into  this  codification, 
and  a  new  point  of  view  upon  the  matter  was  developed  based  upon  the 
idea  of  abstract  space  with  a  superimposed  structure  that  may  or  may  not 
be  definable  in  terms  of  some  transformation  group.  We  shall  return  to  this 
point  of  view  in  a  later  chapter,  and  here  close  our  present  discussion  with 
the  remark  that  these  new  geometries  have  found  application  in  the  modern 
theory  of  physical  space  that  is  incorporated  in  Einstein’s  general  theory 
of  relativity. 

5.5  Relations 

We  have  seen  that  the  postulates  of  a  postulational  discourse  are 
primary,  or  assumed,  statements  about  a  set  of  primitive  terms,  the  term 
primary  being  employed  to  indicate  that  the  statements  are  not  derived 
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logically  from  other  statements  within  the  discourse.  The  primitive  terms 
may  consist  of  unspecified  relations  connecting  unspecified  elements,  as  is 
the  situation  in  Hilbert’s  postulational  treatment  of  plane  Euclidean  geom¬ 
etry,  where  the  primitive  elements  are  points  and  lines ,  and  the  primitive 
relations  connecting  the  elements  are  those  denoted  by  on ,  between ,  and 
congruent ;  or  the  primitive  terms  may  consist  of  unspecified  operations  on 
unspecified  elements,  as  is  the  case,  for  example,  in  the  postulate  set  for  a 
field;  or,  again,  the  primitive  terms  may  consist  of  all  three  kinds— elements, 
relations  connecting  the  elements,  and  operations  on  the  elements.  Much 
of  the  present  chapter  has  been  devoted  to  a  consideration  of  operations. 
Thus  we  have  classified  operations  into  unary,  binary,  ternary,  and  so  on, 
and  we  have  described  those  properties  of  binary  operations  known  as 
commutativity,  associativity,  and  distributivity  of  one  operation  over  another. 
So  far,  however,  we  have  said  nothing  of  a  similar  nature  about  relations. 
This  is  an  important  matter,  and  it  will  now  receive  our  attention. 

Everyone,  of  course,  is  familiar  with  the  idea  of  a  relation  as  a  form  of 


connectivity  between  two  or  more  things,  for  it  is  a  concept  which  is  not 
peculiar  to  mathematics  but  permeates  everyday  life  and  conversation. 


Thus  it  is  common  to  hear  of  such  relations  as  “is  the  father  of,”  “is  a  cousin 
of,”  “is  married  to,”  “admires,”  “is  to  the  left  of,”  “is  above,”  “is  smaller 
than,”  “is  taller  than,”  “is  heavier  than,”  “is  the  same  color  as,”  “implies,” 
“is  between,”  and  so  on.  Elementary  geometry  is  interested  in  such  relations 
as  “is  similar  to,”  “coincides  with,”  “is  congruent  to,”  “is  parallel  to,” 
“is  perpendicular  to,”  and  “is  collinear  with,”  and  the  arithmetic  of  numbers 
is  concerned  with  such  relations  as  “is  equal  to,”  “is  greater  than,  is  less 
than,”  “is  the  sum  of,”  and  “is  the  product  of.” 

Relations,  like  operations,  can  be  classified  according  to  the  number  of 
elements  involved.  Here,  however,  it  is  customary  to  use  Greek  derivatives 


instead  of  Latin  derivatives,  and  we  call  a  relation  connecting  two  elements 
a  dyadic  relation,  and  one  connecting  three  elements  a  triadic  relation. 
Similarly,  there  are  tetradic ,  pentadic ,  and  higher  polyadic  relations.  The  most 
commonly  occurring  relations,  in  both  general  and  mathematical  use,  are 
dyadic  relations,  as  the  above  list  exemplifies.  The  relation  is  between  is, 
however,  an  example  of  a  triadic  relation,  for  it  is  always  used  in  the  form 
“a  is  between  b  and  r.”  Other  triadic  relations  appearing  in  the  above  list 
are  “is  collinear  with,”  “is  the  sum  of,”  and  “is  the  product  of,  when  used 
in  the  forms  “a  is  collinear  with  b  and  c,”  “a  is  the  sum  of  b  and  r,”  and 
“a  is  the  product  of  b  and  r.” 

Although  relations  are  always  used  in  connection  with  terms,  we  can 
consider  the  general  properties  of  relations  independently  of  the  particular 
terms  related.  We  shall  limit  ourselves  to  only  the  commonest  properties 


of  dyadic  relations,  namely,  reflexiveness ,  irreflexiveness ,  nonreflexiveness,  symmetry , 
asymmetry ,  nonsymmetry ,  transitivity ,  intr ansitivity ,  and  nontransitivity .  Since,  in 
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our  consideration  we  are  not  interested  in  particular  dyadic  relations  but  in 
their  possible  properties,  we  shall  symbolize  any  dyadic  relation  whatsoever 
by  R  and  the  elements  that  it  connects  by  a ,  b,  c,  and  so  on.  If  a  is  related 
to  b  by  the  R  relation,  we  shall  write  a  R  b;  if  a  is  not  related  to  b  by  the 
R  relation,  we  shall  write  a  ft  b. 

A  dyadic  relation  R  is  said  to  be  reflexive  in  a  set  S  of  elements  if,  for 
each  element  a  of  S,  it  must  be  true  that  a  R  a.  For  example,  the  relation 
“is  equal  to,”  when  applied  to  numbers,  is  reflexive,  since  a  =  a  for  every 
number  a.  But  the  relation  “is  the  father  of,”  applied  to  people,  is  not 
reflexive,  since  no  person  is  the  father  of  himself.  A  dyadic  relation  R  is 
said  to  be  irreflexive  in  a  set  S  of  elements  if,  for  each  element  a  of  S ,  it  is 
true  that  a  ft  a.  The  relation  “is  greater  than,”  when  applied  to  numbers, 
is  obviously  irreflexive.  A  dyadic  relation  R  is  said  to  be  nonreflexive  in  a  set  S 
if  for  some  a  in  S  we  have  a  R  a  and  for  some  b  in  S  we  have  b  ft  b.  The 
relation  “admires,”  applied  to  people,  is  nonreflexive,  for  if  a  is  a  person, 
a  may  or  may  not  admire  himself.  Likewise,  “is  the  square  of,”  applied  to 
real  numbers,  is  nonreflexive,  for  0  is  the  square  of  0,  but  2  is  not  the 
square  of  2. 

A  dyadic  relation  R  is  said  to  be  symmetric  in  a  set  S  if  whenever  a  R  b 
for  a  and  b  in  51,  we  also  have  b  R  a.  For  example,  the  dyadic  relations  “is 
equal  to,”  applied  to  numbers,  and  “is  married  to,”  applied  to  people,  are 
symmetric  relations.  For  if  a  and  b  are  two  numbers  and  if  a  —  b,  then 
b  =  a;  also  if  a  and  b  are  two  people  and  a  is  married  to  b,  then  b  is  married  to  a. 
A  dyadic  relation  R  is  said  to  be  asymmetric  in  a  set  S  if  whenever  a  R  b  for 
a  and  b  in  S,  we  have  b  ft  a;  that  is,  when  a  R  b  is  incompatible  with  b  R  a. 
The  relations  “is  greater  than,”  applied  to  numbers,  and  “is  the  father  of,” 
applied  to  people,  are  examples  of  asymmetric  relations.  For  if  a  and  b  are 
two  numbers  and  if  a  >  b,  then  we  cannot  have  b  >  a;  also,  if  a  and  b 
are  two  people  and  if  a  is  the  father  of  b,  then  we  cannot  have  that  b  is  the 
father  of  a.  A  dyadic  relation  R  is  said  to  be  nonsymmetric  in  a  set  S  if  for  some 

a,  b  in  S  we  have  both  a  R  b  and  b  R  a,  and  for  some  c,  d  in  S  we  have  both 
c  R  d  and  d  ft  c.  Thus  the  dyadic  relation  R  is  nonsymmetric  if  a  R  b  is 
compatible  with,  but  does  not  imply,  b  R  a.  As  examples  of  nonsymmetric 
dyadic  relations  we  have  “is  a  brother  of”  and  “admires,”  when  applied  to 
people,  and  “implies,”  when  applied  to  propositions.  If  a  is  a  brother  of 

b,  b  may  or  may  not  be  a  brother  of  a\  if  John  is  a  brother  of  Jane,  then 
Jane  is  not  a  brother  of  John,  but  if  John  is  a  brother  of  James,  then  James 
is  a  brother  of  John.  If  a  admires  b,  b  may  or  may  not  admire  a.  Finally, 
if  proposition  a  implies  proposition  b,  proposition  b  may  or  may  not  imply 
proposition  a.  An  arithmetical  example  of  a  nonsymmetric  relation  is  the 
relation  “is  not  greater  than,”  applied  to  numbers,  for  if  a  and  b  are  numbers 
and  if  a  is  not  greater  than  b,  then  b  may  or  may  not  be  not  greater  than  a. 

A  dyadic  relation  R  is  said  to  be  transitive  in  a  set  S  if  whenever  a  R  b 
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and  b  R  c,  for  a,  b,  c  in  S,  we  also  have  a  R  c.  Examples  of  transitive  relations 
are  “is  equal  to”  and  “is  greater  than,”  applied  to  numbers,  “is  included  in,” 
applied  to  classes,  and  “implies,”  applied  to  propositions.  If  a ,  b,  c  are 
numbers  and  if  a  =  b  and  b  —  c,  then  a  —  c,  and  if  a  >  b  and  b  >  c,  then 
a  >  c.  If  a,  b,  c  are  classes  of  things,  and  if  class  a  is  included  in  class  b  and 
class  b  is  included  in  class  c,  then  class  a  is  included  in  class  c.  Finally,  if 
a ,  b ,  c  are  propositions,  and  if  a  implies  b  and  b  implies  c,  then  a  implies  c. 
The  relationship  of  class  inclusion  is  fundamental  in  the  construction  of 
syllogisms,  and  the  relationship  of  propositional  implication  is  fundamental 
in  the  establishment  of  a  deductive  chain  of  argument.  Since  both  of  these 
latter  relations  are  transitive,  it  is  no  great  exaggeration  to  say  that  the 
structure  of  transitivity  is  the  mainspring  of  deductive  reasoning.  Also,  it  is 
this  property  of  transitivity  that  makes  elimination  possible;  if  a  R  b  and 
b  R  c,  we  can  eliminate  b  and  assert  a  R  c.  A  dyadic  relation  R  is  said  to  be 
intransitive  in  a  set  S  if  whenever  a  R  b  and  b  R  c,  for  a ,  b,  c  in  S,  we  have 
a  ft  c.  For  example,  the  relation  “is  the  father  of,”  applied  to  people,  is 
intransitive,  for  if  a  is  the  father  of  b  and  b  is  the  father  of  c,  then  a  cannot 
be  the  father  of  c  ( a  must  be  a  grandfather  of  c).  A  dyadic  relation  R  is 
said  to  be  nontr ansitive  in  a  set  S  if  for  some  a,  b,  c  in  S  we  have  a  R  b,  b  R  c, 
a  R  c,  and  for  some  c,  d,  e  in  S  we  have  c  R  d,  d  R  e,  c  ft  e.  An  example  is 
the  dyadic  relation  “admires,”  applied  to  people.  If  a  admires  b  and  b 
admires  c,  then  a  may  or  may  not  admire  c. 

Note  that  the  properties  above  depend  not  only  upon  the  particular 
dyadic  relation  involved,  but  also  upon  the  set  of  elements  in  which  the 
relation  is  defined.  Thus  “is  the  square  of”  is  reflexive,  symmetric,  and 
transitive  in  the  set  consisting  of  the  numbers  0  and  1  only;  it  is  nonreflexive, 
symmetric,  and  transitive  in  the  set  consisting  of  0,  1,  and  2  only;  it  is  non¬ 
reflexive,  nonsymmetric,  and  nontransitive  in  the  set  of  all  real  numbers; 
it  is  irreflexive,  asymmetric,  and  intransitive  in  the  set  consisting  ol  all  the 
real  numbers  except  0  and  1 . 

It  should  be  clear  that  the  three  sets  of  properties— the  reflexive  set, 
the  symmetric  set,  and  the  transitive  set — are  totally  independent  of  one 
another.  That  is  to  say,  a  dyadic  relation  may  at  the  same  time  possess  any 
one  of  the  three  reflexive  properties,  any  one  of  the  three  symmetric  prop¬ 
erties,  and  any  one  of  the  three  transitive  properties.  For  example,  the 
relation  “is  contemporaneous  with,”  when  applied  to  people,  is  reflexive, 
symmetric,  and  transitive;  the  relation  “is  an  ancestor  of,”  applied  to  people, 
is  irreflexive,  asymmetric,  and  transitive;  the  relation  “is  married  to,’  applied 
to  people,  is  irreflexive,  symmetric,  and  intransitive;  the  relation  “admires, 
applied  to  people,  is  nonreflexive,  nonsymmetric,  and  nontransitive;  the 
relation  “is  the  father  of,”  applied  to  people,  is  irreflexive,  asymmetric,  and 
intransitive.  Since  there  are  three  reflexive  properties,  three  symmetric 
properties,  and  three  transitive  properties,  there  are  27  possible  triples  of 
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these  properties  that  dyadic  relations  may  possess.  It  is  an  interesting  exercise 
to  try  to  construct  examples  illustrating  each  of  the  27  possibilities. 

Of  the  above  27  possible  types  of  dyadic  relations,  there  is  one  which  is 
of  paramount  importance  in  mathematics.  This  is  the  so-called  equivalence 
relation ,  which  is  the  type  that  is  reflexive,  symmetric,  and  transitive.  In 
other  words,  a  dyadic  relation,  which  we  shall  here  denote  by  the  symbol  ~, 
is  called  an  equivalence  relation  in  a  set  S  of  elements  if  and  only  if  the 
following  postulates  hold: 

El:  If  a  is  an  element  of  S,  then  a  ~  a. 

E2:  If  a  and  b  are  elements  of  S  and  if  a  ~  b,  then  b  ~  a. 

E3:  If  a,  b,  c  are  elements  of  S  and  if  a  ~  b  and  b  ~  c,  then  a  ~  c. 

Examples  of  equivalence  relations  are  abundant.  Thus  we  have  “is  con¬ 
temporaneous  with”  and  “is  the  same  age  as,”  applied  to  people,  “is  identical 
with,”  applied  to  any  set  of  elements,  “is  equal  to,”  applied  to  numbers, 
“is  similar  to,”  and  “is  congruent  to,”  applied  to  triangles  in  geometry,  and 
so  on.  All  of  these  diverse  examples  have  the  common  algebraic  structure 
implied  by  the  above  postulate  set.  The  examples,  incidentally,  since  they 
are  interpretations  of  the  set  of  postulates,  establish  the  consistency  of  the 
postulate  set.  It  is  evident  that  the  concept  of  an  equivalence  relation  is  a 
generalization  of  the  equals  relation,  considered  in  the  sense  of  identity, 
which  we  have  used  frequently  in  the  preceding  sections  of  this  chapter. 

An  equivalence  relation  ~  in  a  set  S  separates  the  elements  of  S  into 
classes.  Thus  all  elements  equivalent  to  some  particular  element  a  of  b1 
belong  in  one  class;  all  elements  equivalent  to  some  other  element  b  of  S, 
not  already  in  the  first  class,  belong  to  a  second  class;  and  so  on.  It  can  be 
shown  that  this  process  leads  to  a  unique  decomposition  of  S  into  non¬ 
overlapping  classes.  This  separation  of  S  into  classes  is  called  the  class  decom¬ 
position  of  S  corresponding  to  the  equivalence  relation  ~ . 

As  a  simple  example  of  the  class  decomposition  of  a  set  S  corresponding 
to  an  equivalence  relation  in  S ,  consider  the  following.  Let  S  be  the  set  ol 
positive  integers  and  let  “a  ~  6”  mean  “a  differs  from  b  by  a  multiple  of  3.” 
This  is  easily  seen  to  be  an  equivalence  relation  if  we  keep  in  mind  that  0 
is  a  multiple  of  3,  namely,  the  Oth  multiple.  Decomposing  the  positive  integers 
into  classes,  by  putting  into  the  same  class  all  pairs  of  positive  integers  that 
differ  by  a  multiple  of  3,  we  obtain  the  three  unique  and  nonoverlapping 
classes:  { 1 ,  4,  7,  •  •  •  j ,  j  2,  5,  8,  •  •  •  j ,  and  (3,6,9,  •  •  ■  j . 

A  significant  application  of  the  decomposition  of  a  set  into  classes  by 
means  of  an  equivalence  relation  will  be  considered  in  the  next  chapter  when 
we  take  up  the  important  concept  of  isomorphism . 

With  this  we  bring  the  present  chapter  to  a  conclusion.  Its  purpose  was 
to  show  how  the  recognition  of  algebraic  structure,  in  the  early  half  of  the 
nineteenth  century,  led  to  the  liberation  of  algebra  from  simple  arithmetic 
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and  thence  to  the  postulational  treatment  of  many  different  algebras,  so  that 
an  algebra,  like  a  geometry,  became  a  purely  hypothetico-deductive  study. 
At  the  same  time,  we  wished  to  introduce  the  reader  to  some  of  the  flavor 
and  terminology  of  modern  abstract  algebra.  Use  will  be  made  in  later 
chapters  of  many  of  the  algebraic  concepts  considered  in  this  chapter. 


_ PROBLEMS 

5.1.1  Sometimes  an  n -ary  operation  on  a  set  S  of  elements  is  defined,  less 
restrictively  than  in  Section  5.1,  as  a  rule  which  assigns  to  each  ordered  subset 
of  n  elements  of  S'  a  uniquely  defined  element  of  some  set  7 ;  if  T  is  S,  or  a 
subset  of  S,  then  S  is  said  to  be  closed  under  the  operation.  Thus  the  set  of 
all  even  integers  is  closed  under  the  binary  operation  of  addition,  for  the 
sum  of  two  even  integers  is  always  an  even  integer,  but  the  set  of  all  odd 
integers  is  not  closed  under  the  binary  operation  of  addition,  for  the  sum 
of  two  odd  integers  is  not  always  (in  fact,  is  never)  an  odd  integer.  Which 
of  the  following  sets  are  closed  under  the  given  operations? 

(a)  The  set  of  integers  from  1  to  20,  inclusive,  under  the  binary  opera¬ 
tion  of  addition. 

(b)  The  set  of  all  odd  integers  under  the  binary  operation  of  multi¬ 
plication. 

(c)  The  set  of  all  positive  integers  under  the  binary  operation  of  division. 

(d)  The  set  of  all  positive  rational  numbers  under  the  binary  operation 
of  division. 

(e)  The  set  consisting  of  the  single  number  1  under  the  binary  operation 
of  multiplication. 

(f)  The  set  of  all  positive  rational  numbers  under  the  unary  operation 
of  taking  the  positive  square  root. 

(g)  The  set  of  all  positive  integers  under  the  binary  operation  of 
subtraction. 

(h)  The  set  of  all  prime  numbers  p  under  the  unary  operation 
p2  —  p  -\-  41. 

(i)  The  set  of  all  integral  multiples  a,  b,  c,  •  •  •  of  3  under  the  ternary 
operation  ab  +  c. 

(j)  The  set  of  all  odd  integers  a,  b,  c,  •  •  •  under  the  ternary  operation 
ab  T  c  d~  1 • 

(k)  The  set  of  all  gases  at  70°  Fahrenheit  under  the  binary  operation 
of  chemical  combination  of  pairs  of  gases. 

(l)  The  set  of  all  colors  under  the  binary  operation  of  mixing  pairs  of 

colors. 

5.1.2  (a)  Is  the  operation  T  distributive  over  the  operation  X  in  the  set 
of  all  integers? 

(b)  Is  the  operation  T  distributive  over  the  operation  X  in  Example 
(k)  of  Section  5.1? 
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5.1.3  Two  unary  operations  (defined  for  the  same  set)  are  said  to  be 
commutative  with  respect  to  each  other  if  the  same  final  result  is  achieved  regardless 
of  the  order  of  performance  of  the  two  operations.  Which  of  the  following 
pairs  of  unary  operations  are  commutative  with  respect  to  one  another? 

(a)  (1)  Adding  3,  (2)  adding  5,  applied  to  the  set  of  positive  integers. 

(b)  (1)  Squaring,  (2)  cubing,  applied  to  the  set  of  positive  integers. 

(c)  (1)  Adding  3,  (2)  multiplying  by  5,  applied  to  the  set  of  positive 
integers. 

(d)  (1)  Taking  the  cosine,  (2)  taking  the  sine,  applied  to  the  set  of  real 
numbers. 

(e)  ( for  students  who  have  studied  calculus)  (1)  dyldx  +  ay,  (2)  dyjdx  —  ay, 
applied  to  the  set  of  analytic  functions  y  of  x. 

(f)  (1)  Putting  on  a  pair  of  socks,  (2)  putting  on  a  pair  of  shoes,  applied 
to  the  set  of  a  person’s  appearances. 

(g)  (1)  Insuring  an  automobile,  (2)  colliding  the  automobile  with  that 
of  a  struggling  lawyer,  applied  to  the  set  of  legal  demands  on  one’s  pocket- 
book.  * 

(h)  (1)  Shelling  an  egg,  (2)  scrambling  the  egg,  applied  to  the  set  of 
appetizing  foods. 

(i)  (1)  Disrobing,  (2)  taking  a  bath,  applied  to  the  set  of  the  appear¬ 
ances  of  one’s  clothing. 

5.1.4  Reduce  the  left  member  of  each  of  the  following  equalities  to  the 
right  member  by  using  successively  an  associative,  commutative,  or  dis¬ 
tributive  law.  Following  custom,  multiplication  is  here  sometimes  indicated 
by  a  dot  (•)  and  sometimes  by  mere  juxtaposition  of  the  factors. 

(a)  5(6  +  3)  =  3  •  5  +  5  •  6. 

(b)  5(6  •  3)  =  (3  •  5)6. 

(c)  4  •  6  +  5  •  4  =  4(5  +  6). 

(d)  a[b  +  (c  -p  d)~\  =  ( ab  -p  ac)  +  ad. 

(e)  a[b(cd)]  =  (be) (ad). 

(f)  a[b(cd)]  =  (cd)(ab). 

(g)  (ad  4-  ca)  -p  ab  =  a\(b  -p  c)  T  d\. 

(h)  a  -p  [b  -p  (c  +  d)~\  =  [(<2  -p  b)  +  c]  -p  d. 

5.1.5  (a)  through  (k)  Actually  show  that  the  examples  of  Section  5.1  satisfy 
the  five  basic  properties  of  that  section.  Which  of  the  examples  satisfy  the 
postulates  for  a  field?  for  an  ordered  field? 

5.1.6  Establish  the  following  theorems  for  any  field  S: 

(a)  If  a  0  z  —  a  and  a  0  z!  =  a  for  all  elements  a  of  S,  then  z  —  z’ ■ 
(This  proves  that  the  zero  element  is  unique.) 

(b)  If  a,  b,  c  are  elements  of  S  and  if  a  ®  b  =  a  ©  c,  then  b  —  c.  (This 
is  the  (left)  cancellation  law  for  the  operation  (J).) 

(c)  Given  two  elements  a  and  b  of  S,  then  there  exists  a  unique  element 

v  of  S  such  that  a  @  x  =  b.  (This  proves  that  “subtraction”  is  always  possible 

in  a  field.) 


*R.  P.  Agnew,  p.  98. 
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(d)  If  a  is  any  element  of  S,  then  a  (X)  z  —  Z . 

(e)  If  a  and  b  are  any  two  elements  of  S,  then  a  (X)  b  =  a  &)tb. 

(f)  If  <7  and  b  are  elements  of  £  and  if  a  (X)  6  =  £,  then  either  a  —  z 
or  b  =  z. 

(g)  Given  two  elements  a  and  b  of  S,  a  7^  z,  then  there  exists  a  unique 
element  x  of  S  such  thajt  a  (X)  x  =  b.  (This  proves  that  “division,”  except  by 
zero,  is  always  possible  in  a  field.) 

(h)  Show  that  a  (X)  b  =  <2  (X)  b  =  a  (X)  b. 

5.1.7  Establi<sh  the  following  theorems  for  any  ordered*  field  S : 

(a)  If  <3  is  a  positive  element  of  £,  then  a  is  negative. 

(b)  Element  a  of  S  is  positive  if  and  only  if  a  ©  z- 

(c)  Element  a  of  S  is  negative  if  and  only  if  a  @  z, 

(d)  If  a  and  b  are  distinct  elements  of  S,  then  either  a  ©  b  or  a  ©  b, 
but  not  both. 

(e)  If  fl,  b ,  c  are  elements  of  S  and  if  a  @  b  and  b  ©  c,  then  a  ©  c. 

(f)  If  a  ©  b  and  c  is  any  element  of  S,  then  (a  @  c)  ©  (b  ©  c). 

(g)  If  a  ©  b  and  c  is  positive,  then  {a  ©  c)  ©  (b  ©  c). 

(h)  If  a  and  b  are  positive  elements  of  S ,  then  (1)  0  (x)  6  =  d  ©  b  —  the 

negative  element,  a  ©  b,  (2)  a  ©  b  =  the  positive  element,  a  (x)  b. 

(i)  If  <2  ^  £  is  an  element  of  £,  then  a  ©  <2  is  positive. 

(j)  The  element  u  is  positive. 

5.1.8  Show  that  any  set  of  complex  numbers,  in  which  the  sum,  difference, 
product,  and  quotient  (with  denominator  different  from  0)  of  any  two  num¬ 
bers  in  the  set  are  again  in  the  set,  constitutes  a  field  under  addition  and 
multiplication. 

5.2.1  Let  S  be  the  set  of  all  ordered  real  number  pairs  and  define  equality, 
addition,  and  multiplication  of  the  number  pairs  by  (1)  ( a,b )  =  (c,d)  if  and 
only  if  a  =  c  and  b  =  d,  (2)  (a,b)  +  (c,d)  =  (a  -f-  c,  b  +  d),  (3)  ( a,b)(c,d )  = 
(ac  —  bd ,  ad  +*  be). 

(a)  Show  that  addition  is  commutative  and  associative. 

(b)  Show  that  multiplication  is  commutative  and  associative. 

(c)  Show  that  multiplication  is  distributive  over  addition. 

(d)  Show,  in  fact,  that  the  number  pairs  constitute  a  field  under 
addition  and  multiplication. 

5.2.2  In  the  set  S  of  Problem  5.2.1,  with  the  same  definitions  of  equality, 
addition,  and  multiplication,  show  that 

(a)  (a,0)  +  (b, 0)  =  (a  +  b, 0), 

(b)  (0,0) (b,0)  =  (ab, 0), 

(c)  (0 ,b)  =  © 0)(0,1), 

(d)  (0, 1 ) (0, 1 )  =  (-1,0). 

5.2.3  (a)  Add  the  two  quaternions  (1,  0,  —2,  3)  and  (1,  1,  2,  —2). 

(b)  Multiply,  in  both  orders,  the  two  quaternions  (1,  0,  —2,  3)  and 
(1,  1,  2,  -2). 
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5.2.4  (a)  Show  that  addition  of  quaternions  is  commutative  and  associative. 

(b)  Show  that  multiplication  of  quaternions  is  associative  and  dis¬ 
tributive  over  addition. 

(c)  Show  that  the  real  numbers  are  embedded  within  the  quaternions. 

(d)  Show  that  the  complex  numbers  are  embedded  within  the 
quaternions. 

5.2.5  (a)  Verify  the  multiplication  table  for  quaternionic  units  given  in 
Section  5.2. 

(b)  Multiply  the  two  quaternions  a  bi  cj  -\-  dk  and  e  +  fi  -f-  gj  -f-  hk 
like  polynomials  in  i,  j,  k,  and,  by  means  of  the  multiplication  table  for  the 
quaternionic  units,  check  into  the  defined  product  of  the  two  quaternions. 

5.2.6  Show  that  any  set  of  complex  numbers,  in  which  the  difference 
and  product  of  any  two  numbers  in  the  set  are  again  in  the  set,  constitutes 
a  commutative  ring  under  addition  and  multiplication. 

5.2.7  Show  that  the  set  of  even  integers,  under  ordinary  addition  and  mul¬ 
tiplication,  constitutes  an  integral  domain  without  a  unity. 

5.2.8  Show  that  the  set  of  all  integers,  under  ordinary  addition  and  multi¬ 
plication,  constitutes  an  integral  domain  but  does  not  constitute  a  field. 

5.2.9  Show  that  Example  (i)  of  Section  5.1  is  not  an  example  of  an  integral 
domain. 

5.2.10  If 

x'  =  ax  -f-  by ,  x"  =  ex'  -j-  fy' , 
v'  =  cx  +  dy ,  y"  =  gx'  -f  hy' , 

show  that 

x"  =  ( ea  +  fd)x  +  (eb  -f-  fd)y, 
y"  =  (ga  +  hc)x  -f  (gb  +  hd)y. 

5.2.11  Given  the  matrices 


~2  -3 

"-2 

2 

,  B  = 

_4  1_ 

0 

3_ 

calculate  A  +  B,  AB ,  BA,  and  A2. 


(~0  11 


5.2.12  Show  that  the  matrix 


_°  °. 

5.2.13  Show  that  the  set  of  all  2  by  2 
a  noncommutative  ring  with  unity 
plication  of  matrices. 


has  no  square  root. 

matrices  with  real  elements  constitutes 
under  the  usual  addition  and  multi- 


5.2.14  Show  that  we  may  define  complex  numbers  as  matrices  of  the  form 

a  b 

_  —  b  a  _ 

where  a  and  b  are  real,  subject  to  the  usual  definitions  of  addition  and 
multiplication  of  matrices. 


/ 
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5.2.15  Show  that  we  may  define  real  quaternions  as  matrices  of  the  form 


a  T  bi  c  +  di 

-c-\-di  a  —  bi 


where  a ,  b,  c,  d  are  real  and  i 2  =  —  1,  subject  to  the  usual  definitions  of 
addition  and  multiplication  of  matrices. 


5.2.16  Taking  A  — 


1  0 

~  1  1 

"1  1 

,  B  = 

,  c  = 

_-l  0_ 

_-l  1_ 

_0  1„ 

as  elements 


of  a  Jordan  algebra,  calculate  A  T  B,  AB,  BA,  A(BC),  and  (AB)C. 

5.2.17  Taking  A,  B,  C  of  Problem  5.2.16  as  elements  of  a  Lie  algebra, 
calculate  A  +  B,  AB,  BA,  A(BC),  and  (AB)C. 

5.2.18  Determine  whether  the  following  binary  operations  *  and  ,  defined 
for  positive  integers,  obey  the  commutative  and  associative  laws,  and  whether 
the  operation  |  is  distributive  over  the  operation  *  : 

(a)  a  *  b  =  a  T  2b,  a  \  b  —  2 ab. 

(b)  a  *  b  =  a  A-  b2,  a  \  b  =  ab2. 

(c)  a  *  b  =  a2  +  b2,  a  \  b  =  a2b2. 

(d)  a  *  b  =  ab,  a  \  b  =  b. 

5.2.19  Let  S  be  the  set  of  all  ordered  real  number  pairs  and  define:  (1) 
(, a,b )  =  (c,d)  if  and  only  if  a  —  c  and  b  —  d;  (2)  (a,b)  +  (c,d)  =  (a  +  c, 
b  +  d)\  (3)  ( a,b){c,d )  =  (0,ac);  (4)  k{a,b)  =  ( ka,kb ). 

(a)  Show  that  multiplication  is  commutative,  associative,  and  dis¬ 
tributive  over  addition. 

(b)  Show  that  the  product  of  three  or  more  factors  is  always  equal 
to  (0,0). 

(c)  Construct  a  multiplication  table  for  the  units  u  =  (1,0)  and 
»  =  (0,1). 

5.2.20  Hamilton’s  quaternions  and,  to  some  extent,  Grassmann’s  calculus 
of  extension  were  devised  by  their  creators  as  mathematical  tools  for  the 
exploration  of  physical  space.  These  tools  proved  to  be  too  complicated  for 
quick  mastery  and  easy  application,  but  from  them  emerged  the  much  more 
easily  learned  and  more  easily  applied  subject  of  vector  analysis.  This  work 
was  due  principally  to  the  American  physicist  Josiah  Willard  Gibbs  (1839- 
1903),  and  is  encountered  by  every  student  of  elementary  physics.  In 
elementary  physics,  a  vector  is  graphically  regarded  as  a  directed  line 
segment,  or  arrow,  and  the  following  definitions  of  equality,  addition,  and 
multiplication  of  these  vectors  are  made: 

(1)  Two  vectors  a  and  b  are  equal  if  and  only  if  they  have  the  same 
length  and  the  same  direction. 

(2)  Let  a  and  b  be  any  two  vectors.  Through  a  point  in  space  draw 
vectors  a'  and  b'  equal,  respectively,  to  vectors  a  and  b,  and  complete  the 
parallelogram  determined  by  a'  and  b' .  Then  the  sum,  a  +  b,  of  vectors 
a  and  b  is  a  vector  whose  length  and  direction  are  those  of  the  diagonal 
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running  from  the  common  origin  of  a'  and  b'  to  the  fourth  vertex  of  the 
parallelogram. 

(3)  Let  a  and  b  be  any  two  vectors.  By  the  vector  product ,  a  X  b,  of  these 
two  vectors  is  meant  a  vector  whose  length  is  numerically  equal  to  the  area 
of  the  parallelogram  in  definition  (2),  and  whose  direction  is  that  of  the 
progress  of  an  ordinary  screw  when  placed  perpendicular  to  both  a'  and  b' 
and  twisted  through  the  angle  of  not  more  than  180°  which  will  carry 
vector  a'  into  vector  b'. 

(a)  Show  that  vector  addition  is  commutative  and  associative. 

(b)  Show  that  vector  multiplication  is  noncommutative  and  non- 
associative. 

(c)  Show  that  vector  multiplication  is  distributive  over  vector  addition. 

5.2.21  Show  that,  in  the  definition  of  an  integral  domain,  Postulate  P9 
may  be  replaced  by 

P9h  If  a  and  b  are  in  S  and  if  a  ^  z  and  b  9^  z,  then  a  (X)  b  5^  z. 

5.2.22  Let  capital  letters  P,  (9,  R,  •  •  •  denote  points  of  the  plane.  Define 
addition  of  points  P  and  (9  by  P  +  (9  =  R,  where  triangle  PQR  is  a  counter¬ 
clockwise  equilateral  triangle. 

(a)  Show  that  the  addition  of  points  of  the  plane  is  noncommutative 
and  nonassociative. 

(b)  Show  that  if  P  +  (9  =  R,  then  (9  +  R  =  P. 

(c)  Establish  the  following  identities: 

(1)  (P  +  (P  +  (p  +  (P  +  (P  +  (P  +  Q))))))  =  o, 

(2)  p  +  (p  +  (p  +  q))  =  (a  +  p)  +  (p  +  o), 

(3)  (P  +  0)  +  R  =  (P  +  (ft  +  H))  +  d- 

5.3.1  Show  that  the  set  of  two  numbers,  1,  —1,  under  ordinary  multipli¬ 
cation,  constitutes  a  group  which  is  a  subgroup  of  that  of  Example  (d)  of 
Section  5.3. 

5.3.2  (a)  Do  the  even  integers  form  a  group  with  respect  to  addition? 

(b)  Do  the  odd  integers  form  a  group  with  respect  to  addition? 

(c)  Do  all  the  rational  numbers  form  a  group  with  respect  to  multi¬ 
plication? 

(d)  Let  a  *  b  =  a  —  b,  where  a  and  b  are  integers.  Do  the  integers 
form  a  group  with  respect  to  this  operation? 

(e)  Do  all  the  integral  multiples  of  3  form  a  group  with  respect  to 
addition? 

5.3.3  Let  G  be  the  set  of  all  rotations 

x'  =  x  cos  6  —  y  sin  6 , 

R: 

y’  =  v  sin  6  +  y  cos  6, 

of  the  plane  about  the  origin,  and  let  R2  *  R\  denote  the  result  of  performing 
first  rotation  Ri  and  then  rotation  R2.  Show  that  G,  under  the  operation  *  , 
constitutes  an  infinite  Abelian  group. 

5.3.4  Construct  the  operation  table  for  Example  (d)  of  Section  5.3. 
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5.3.5  Let  G  be  the  set  of  five  integers  0,  1,  2,  3,  4,  and  let  a  *  b  denote 
the  remainder  obtained  by  dividing  the  ordinary  product  of  a  and  b  by  5. 
Does  G,  under  the  operation  *  ,  constitute  a  group? 

5.3.6  (a)  Actually  show  that  Example  (g)  of  Section  5.3  constitutes  an 
infinite  non-Abelian  group. 


(b)  Calculate  the  inverse  A  1  of  matrix  A 


"2  1 
3  -1 


and  show  that 


the  product  AA^1  is  the  identity  matrix. 

5.3.7  (a)  Form  the  operation  table  for  the  group  of  Example  (h)  of 
Section  5.3. 

(b)  We  have  defined  the  cross  ratio  of  four  collinear  points  A,  B,  C ,  D, 
taken  in  this  order,  to  be 


r  =  ( ABfiD )  = 


The  value  of  the  cross  ratio  of  four  points  evidently  depends  upon  the  order 
in  which  we  consider  the  points.  There  are,  then,  24  cross  ratios  correspond¬ 
ing  to  the  24  permutations  (or  orders)  of  the  four  points.  Show  that  the 
number  of  distinct  cross  ratios  is  only  six  and  that  they  are  given  by  the  six 
expressions  of  Example  (h).  For  this  reason  the  group  of  Example  (h)  is 
known  as  the  cross  ratio  group. 

5.3.8  (a)  Consider  the  ordered  triple  ( a,b,c ).  The  substitution  of  c  for  a , 
a  for  b,  and  b  for  c ,  can  be  represented  by  the  array 


There  are,  in  all,  six  possible  substitutions  (counting  the  identity  substitution) 
that  can  be  made.  If  S\  and  S%  denote  any  two  of  these  six  substitutions,  let 
S2  *  54  denote  the  result  of  substitution  Si  followed  by  substitution  5V  For 
example, 


Show  that  the  six  substitutions  of  the  ordered  triple  {a,b,c),  under  the 
operation  * ,  constitute  a  finite  non-Abelian  group.  This  group  is  known  as 
the  symmetric  group  of  degree  3 /  there  exists  such  a  substitution  group  for  each 
positive  integral  degree  n. 

(b)  Can  you  find  some  subgroups  of  the  symmetric  group  of  degree  3? 

5.3.9  (a)  Prove  Theorem  5  of  Section  5.3. 

(b)  Prove  Theorem  6  of  Section  5.3. 

(c)  Prove  Theorem  7  of  Section  5.3. 

O)  Prove  Theorem  8  of  Section  5.3. 
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5.3.10  Show  that  Postulates  G2  and  G3  for  a  group  may  be  replaced  by 
G2':  If  a  and  b  are  any  elements  of  G,  then  there  exist  elements  x  and  y  of  G 

such  that  a  *  x  =  b  and  y  *  a  =  b. 

Show  that  the  simpler  assumption  of  just  the  existence  of  y  such  that 
y  *  a  —  b  is  not  sufficient. 

5.3.11  Do  we  still  necessarily  have  a  group  if  Postulate  G3  for  a  group  is 
replaced  by  the  following? 

G3b  For  each  element  a  of  G  there  exists  an  element  a-1  of  G  such  that 
a-1  *  a  =  i. 

5.3.12  Of  increasing  importance  in  mathematics  are  nonassociative  systems, 
such  as  loops.  A  loop  is  a  set  L  of  elements  a,  b,  c,  •  •  •  along  with  a  binary 
operation  *  satisfying  the  following  two  postulates: 

LI :  For  each  a  and  b  in  L,  there  are  unique  elements  x  and  y  in  L  such  that 
a  *  x  =  b  and  y  *  a  =  b. 

L2:  There  exists  an  element  e  in  L  such  that  a*e  =  e*a  =  a  for  every 
a  in  L. 

(a)  Show  that  the  identity  element  e  is  unique. 

(b)  Show  that  a  *  x  =  a  *  y  or  *  *  a  =  y  *  a  implies  x  =  y. 

Each  of  the  following  questions  refers  to  a  loop  having  the  so-called 
left  inverse  property ;  that  is,  for  each  x  in  L  there  is  an  element  x_1  in  L  such 
that  x-1  *  (x  *  y)  =  y  for  every  y  in  L : 

(c)  Show  that  x_1  *  x  =  e. 

(d)  Show  that  x_1  is  unique. 

(e)  Show  that  x  *  x-1  =  e. 

(f)  Show  that  (x-1)-1  =  x. 

A  loop  has  the  right  inverse  property  if,  for  each  x  in  L,  there  is  an  element 
x7  in  L  such  that  (y  *  x)  *  xr  =  y  for  all  j;  in  L. 

(g)  Show  that  if  both  inverse  properties  hold,  then  xr  =  x-1. 

00  Show  that  if  both  inverse  properties  hold,  then  (x  *  y)  1  —  y  1  *  x  T 

5.4.1  Show  thati  the  definitions  of  ring,  commutative  ring,  ring  with  unity, 
integral  domain,  division  ring,  and  field,  as  given  in  Section  5.4  in  terms  of 
groups  and  semigroups,  are  equivalent  to  the  definitions  of  these  algebraic 
structures  as  given  in  Section  5.2  in  terms  of  certain  postulate  sets. 

5.4.2  Find  which  of  the  following  pairs  of  transformations  of  the  points 
of  a  plane  are  commutative  with  respect  to  multiplication: 

(a)  two  translations; 

(b)  two  rotations  about  the  same  fixed  point  0; 

(c)  a  rotation  about  a  fixed  point  A  and  another  rotation  about  a 
different  fixed  point  B; 

(d)  a  rotation  about  a  fixed  point  A  and  a  reflection  in  a  line  m  passing 
through  A; 

(e)  a  rotation  about  a  fixed  point  A  and  a  reflection  in  a  line  m  not 
passing  through  A; 

(f)  a  translation  and  a  reflection  in  a  line  m  parallel  to  the  direction 
of  the  translation; 
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(g)  a  translation  and  a  reflection  in  a  line  m  not  parallel  to  the  direction 
of  the  translation. 

5.4.3  Prove  that  the  inverse  of  the  product  of  two  nonsingular  transforma¬ 

tions  is  the  product  of  the  inverses  of  the  transformations  taken  in  reverse 
order;  that  is,  show  that  ( TS)~l  =  Extend  this  to  the  product  of 

any  number  of  nonsingular  transformations. 

5.4.4  A  transformation  T  such  that  TT  =  /,  the  identical  transformation, 
is  called  an  involutoric  transformation. 

(a)  If  T  is  involutoric,  show  that  T-1  =  T. 

(b)  Give  at  least  three  examples  of  involutoric  transformations  of  the 
points  of  the  plane. 

5.4.5  If  S  and  T  are  two  nonsingular  transformations,  then  the  trans¬ 
formation  S'  —  TST~l  is  called  the  transform  of  S  by  T. 

(a)  Show  that  if  R'  and  S'  are  the  transforms  of  R  and  S,  respectively, 
by  T,  then  R'S'  is  the  transform  of  RS  by  T. 

(b)  Show  that  the  transform  by  T  of  the  inverse  of  S  is  the  inverse  of 
the  transform  by  T  of  S. 

(<■)  Show  that  if  the  product  of  two  nonsingular  transformations  is 
commutative  then  each  is  its  own  transform  by  the  other. 

(d)  If  S'  is  a  rotation  of  the  plane  about  a  fixed  point  A  and  if  T  is  a 
translation  of  the  plane,  find  the  transform  of  S'  by  T  and  the  transform 
of  T  by  S. 

(e)  Let  G i  be  a  subgroup  of  a  transformation  group  G.  If  every  trans¬ 
formation  of  G i  is  replaced  by  its  transform  by  T,  where  T  is  a  fixed  trans¬ 
formation  of  G ,  show  that  the  transformations  thus  found  form  a  subgroup 
of  G. 

5.4.6  Although  the  set  of  all  transformations  of  the  form 

x'  =  x  cos  6  —  y  sin  6, 

y'  =  x  sin  6  -f-  y  cos  d, 

constitutes  a  group  of  transformations  (see  Problem  5.3.3),  show  that  the 
set  of  all  transformations  of  the  form 

x'  =  x  cos  6  -f-  y  sin  6 , 

y  —  x  sin  6  —  y  cos  6, 

does  not  constitute  a  group  of  transformations. 

5.4.7  Show  that  the  set  of  transformations  of  the  form 

.  ax  +  b 

x  —  - , 

cx  +  d 

where  a,  b,  c ,  d  are  real  and  ad  —  be  =  1,  constitutes  a  group  of  transfor¬ 
mations. 
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5.4.8  Show  that  the  set  of  transformations  of  the  form 

x'  =  kx  +  c, 

/  =  i  +  d> 

where  c  and  d  are  real  numbers  and  k  is  a  positive  real  number,  constitutes 
a  group  of  transformations.  The  study  of  the  properties  of  the  points  of  a 
plane  which  remain  invariant  under  this  group  of  transformations  is  known 
as  a  (special)  Lorentz  geometry.  Lorentz  geometries  are  of  importance  in  the 
theory  of  relativity. 

5.4.9  All  rigid  motions  of  a  plane  (rotations,  translations,  and  their  com¬ 
pounds)  constitute  a  group  of  transformations  of  the  set  of  points  of  a  plane 
into  itself.  It  can  be  shown  that  the  analytical  representation  of  a  general 
rigid  motion  is 

x'  =  c  cos  9  —  y  sin  9  +  a, 
yr  =  x  sin  9  -f-  y  cos  9  +  b, 

where  9,  a,  b  are  arbitrary  real  numbers.  Establish,  analytically,  the  following 
invariants  under  the  group  of  rigid  motions  of  the  plane: 

(a)  (*i  —  *2) 2  +  (yi  —  JV2)2,  as  an  invariant  of  the  two  points  (xhyi), 
(*2,72).  What  is  the  geometrical  significance  of  this  invariant? 

(b)  m2  —  <22^1)  / (a ib  1  -j-  a2b2),  as  an  invariant  of  the  two  lines  a\x  +  a2y 
+  <23  =  0,  bix  +  b2y  +£3  =  0.  What  is  the  geometrical  significance  of 
this  invariant? 

(c)  (a  1X0  +  a2yQ  +  ad)/Va 2  fl2,  as  an  invariant  of  the  point  (x0,  j>0) 

and  the  line  a\x  -f  ei2y  +  a3  =  0.  What  is  the  geometrical  significance  of 
this  invariant? 

(d)  *0  +  yl  +  01*0  +  02jVo  +  03,  as  an  invariant  of  the  point  (*o,  JTo) 
and  the  circle  *2  +  y2  +  a\x  +  a2y  +  a3  =  0.  What  is  the  geometrical 
significance  of  this  invariant? 

(e)  Express  a  rigid  motion  analytically  as  the  product  of  a  rotation 
about  the  origin  and  a  translation. 

5.4.10  Consider  the  following  nonsingular  transformations  of  the  plane: 

R :  a  clockwise  rotation  of  90°  about  the  origin, 

R':  a  clockwise  rotation  of  180°  about  the  origin, 

R":  a  clockwise  rotation  of  270°  about  the  origin, 

H:  a  reflection  in  the  x-axis, 

V:  a  reflection  in  thejy-axis. 

D:  a  reflection  in  the  linejy  =  x, 

D':  a  reflection  in  the  linejy  =  —  x, 

/:  the  identity  motion,  in  which  all  points  are  left  unmoved. 

(a)  Show  that  the  eight  transformations  constitute  a  finite  non-Abelian 
transformation  group. 

(b)  Give  the  inverse  of  each  of  the  eight  transformations. 

(e)  Imagine  a  material  square  having  its  center  at  the  origin  and  its 
sides  parallel  to  the  coordinate  axes.  Show  that  each  of  the  above  eight 
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transformations  carries  this  square  into  itself.  The  group  of  transformations 
is  known  as  the  group  of  symmetries  of  a  square. 

5.4.11  Show  that  any  nonempty  set  T  of  nonsingular  transformations  of  a 
set  S  into  itself  for  which  T^Tf1  is  an  element  of  T  whenever  Tx  and  T2  are 
elements  of  T  constitutes  a  transformation  group  under  transformation 
multiplication. 

5.5.1  Give  an  example  of  a  tetradic  relation  and  an  example  of  a  pentadic 
relation. 

5.5.2  Classify  the  following  dyadic  relations  as  to  types  of  reflexiveness, 
symmetry,  and  transitivity: 

(a)  “Was  born  in  the  same  town  as,”  applied  to  people. 

(b)  “Is  the  spouse  of,”  applied  to  people. 

(c)  “Is  the  husband  of,”  applied  to  people. 

(d)  “Is  a  grandfather  of,”  applied  to  people. 

(e)  “Lives  within  a  mile  of,”  applied  to  people. 

(f)  “Disagrees  with,”  applied  to  people. 

(g)  “Is  east  of,”  applied  to  places  in  America. 

(h)  “Is  east  of,”  applied  to  places  anywhere  on  the  Earth  except  the  poles. 

(i)  “Is  less  than,”  applied  to  the  positive  integers. 

(j)  “Is  a  multiple  of,”  applied  to  the  positive  integers. 

(k)  “Is  a  factor  of,”  applied  to  the  positive  integers. 

(l)  “Is  relatively  prime  to,”  applied  to  the  positive  integers. 

(m)  “Is  not  equal  to,”  applied  to  the  positive  integers. 

(n)  “Is  perpendicular  to,”  applied  to  lines  in  a  plane. 

(o)  “Is  parallel  to,”  applied  to  lines  in  a  plane. 

(P)  “Has  the  same  length  as,”  applied  to  line  segments. 

(q)  “Is  skew  to,”  applied  to  lines  in  space. 

(r)  “Is  perpendicular  to,”  applied  to  planes  in  space. 

(s)  “Is  tangent  to,”  applied  to  spheres  in  space. 

(t)  “Is  the  complement  of,”  applied  to  angles. 

(u)  “Is  consistent  with,”  applied  to  propositions. 

(v)  “Contradicts,”  applied  to  propositions. 

(w)  “Is  implied  by,”  applied  to  propositions. 

5.5.3  Try  to  construct  examples  illustrating  each  of  the  27  possible  kinds 
of  dyadic  relations  so  far  as  types  of  reflexiveness,  symmetry,  and  transitivity 
are  concerned. 

5.5.4  Which  of  the  relations  listed  in  Problem  5.5.2  are  equivalence 
relations? 

5.5.5  What  is  wrong  with  the  following  argument  showing  that  a  relation 
which  is  both  symmetric  and  transitive  is  necessarily  reflexive? 

By  symmetry,  a  R  b  implies  b  R  a;  and  by  transitivity,  a  R  b  and  b  R  a 

imply  a  R  a. 

5.5.6  Let  S’  be  the  set  of  all  ordered  pairs  of  positive  integers  and  define 
(a,b)  =  ( c,d )  if  and  only  if  a  +  d,  =  b  -f-  c.  Show  that  this  definition  of 

equality  is  an  equivalence  relation. 
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5.5.7  Let  S  be  the  set  of  all  ordered  pairs  of  integers  and  define  (a,b)  —  ( c,d ) 
if  and  only  if  ad  =  be.  Show  that  this  definition  of  equality  is  an  equivalence 
relation. 

5.5.8  Let  S’  be  a  set  of  elements  a,  b,  c,  •  •  •  in  which  an  equivalence  relation 
~  and  a  binary  operation  *  are  defined.  We  say  that  the  operation  *  is 
well  defined  relative  to  the  equivalence  relation  ~  if  and  only  if  for  a'  ~  a 
and  b'  ~  b  we  always  have  a'  *  b'  ~  a  *  b. 

(a)  Let  S  be  the  set  of  all  ordered  pairs  of  positive  integers,  and  define 
(a,b)  +  (c,d)  to  be  (a  +  c,  b  +  d)  and  ( a,b )  •  (c,d)  to  be  ( ac  +  bd ,  ad  -J-  be). 
Show  that  the  operations  +  and  •  are  well  defined  relative  to  the  equivalence 
relation  of  Problem  5.5.6. 

(b)  Let  S  be  the  set  of  all  ordered  pairs  of  integers,  and  define  (a,b)  T 
(c,d)  to  be  (ad  +  be,  bd)  and  (a,b)  •  (c,d)  to  be  ( ac,bd ).  Show  that  the  opera¬ 
tions  -f  and  •  are  well  defined  relative  to  the  equivalence  relation  of 
Problem  5.5.7. 

(c)  Let  S  be  the  set  of  all  ordered  pairs  of  integers  and  define  (a,b)  *  (c,d) 
to  be  (a,c).  Show  that  this  operation  is  not  well  defined  relative  to  the 
equivalence  relation  of  Problem  5.5.7.  It  follows  that  one  cannot  assume 
that  a  given  binary  operation  is  well  defined  relative  to  a  given  equivalence 
relation;  proof  of  such  is  necessary. 

The  =  sign  in  the  postulates  for  a  field  (see  Section  5.1)  may  be  replaced 
by  any  equivalence  relation  provided  the  operations  ©  and  ©  are  well 
defined  relative  to  it. 

5.5.9  Let  S  be  the  set  of  all  nonnegative  integers,  and  consider  the  dyadic 
relation  “ a  has  the  same  remainder  as  b  when  divided  by  4.” 

(a)  Show  that  the  dyadic  relation  is  an  equivalence  relation. 

(b)  Find  the  class  decomposition  of  S  corresponding  to  this  equivalence 
relation. 

5.5.10  Establish  the  uniqueness  and  the  nonoverlapping  of  the  class 
decomposition  of  a  set  S  corresponding  to  an  equivalence  relation  ~  defined 
in  S  by  proving  the  following  sequence  of  theorems: 

Definition:  If  x  is  an  element  of  S,  let  S(x)  denote  the  class  of  all 
elements  y  of  S  such  that  x  *=  y. 

Theorem  1.  If  x  is  an  element  of  S,  then  *  is  an  element  of  S(x). 

Theorem  2.  Each  element  x  of  S  is  in  some  S(x). 

Theorem  3.  If  x  ~  y ,  then  the  classes  S(x)  and  S(y)  are  identical. 

Theorem  4.  A  class  S(x)  is  completely  determined  by  any  one  of  its 
elements. 

Theorem  5.  If  two  classes  S(x)  and  ©jp)  have  a  common  element  then 
the  two  classes  are  identical. 

Theorem  6.  The  classes  S(x)  constitute  a  unique  decomposition  of  S 
into  nonoverlapping  classes. 

5.5.11  Consider  the  following  definitions: 

Definition  1.  A  dyadic  relation  R  is  said  to  be  determinate  (or  connected) 
in  a  set  S  of  elements  if  and  only  if  for  any  two  distinct  elements  a  and  b 
of  S ,  either  a  R  b  or  b  R  a. 
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Definition  2.  A  dyadic  relation  R  is  said  to  be  antisymmetric  in  a  set  S 
of  elements  if  and  only  if  whenever  a  R  b  and  b  R  a,  then  a  is  identical  with  b. 

Definition  3.  A  set  of  elements  S  is  said  to  be  simply ,  or  linearly ,  ordered 
with  respect  to  a  dyadic  relation  R  if  and  only  if  the  following  postulates  hold: 

Ol :  R  is  determinate  in  S. 

02:  R  is  irreflexive^in  S. 

03:  R  is  transitive  in  S. 

Definition  4.  A  set  of  elements  S  is  said  to  be  partially  ordered  with 
respect  to  a  dyadic  relation  R  if  and  only  if  the  following  postulates  hold: 

Q1 :  R  is  reflexive  in  S. 

Q2:  R  is  antisymmetric  in  S. 

Q3:  R  is  transitive  in  S. 

Definition  5.  If  a  and  b  are  elements  of  S  and  if  a  R  b,  then  we  write 
b  R'  a . 

(a)  Which  of  the  relations  listed  in  Problem  5.5.2  are  determinate? 

(b)  Which  are  antisymmetric? 

(c)  Which  induce  a  simple  ordering? 

(d)  Which  induce  a  partial  ordering? 

(e)  If  R  is  the  relation  “is  less  than,”  applied  to  the  positive  integers, 
what  is  the  relation  R'?  If  R  is  the  relation  “is  not  greater  than,”  applied  to 
the  positive  integers,  what  is  the  relation  R'? 

(f)  Show  that  if  S  is  partially  ordered  with  respect  to  relation  R,  then 
it  is  also  partially  ordered  with  respect  to  relation  R' .  (This  is  known  as 
the  principle  of  duality  for  partially  ordered  sets;  for  any  theorem  about  the 
relation  R  there  is  a  corresponding  theorem  about  the  relation  R'.) 

5.5.12  A  dyadic  relation  R  is  said  to  be  distinctly  transitive  in  a  set  S  of 
elements  if  for  any  three  distinct  elements  x,y,  z  of  S,  x  R  y  andjy  R  z  together 
imply  x  R  z-  Give  an  example  of  a  set  S  and  a  dyadic  relation  R  such  that 
R  is  distinctly  transitive  in  S  but  is  not  transitive  in  S. 


_  CHAPTER  SIX 

FORMAL  AXIOMATICS 


6.1  Statement  of  the  Modern 
Axiomatic  Method * 

The  discovery  of  a  non-Euclidean  geometry  and,  not  long  after,  of  a 
noncommutative  algebra  led  to  a  deeper  study  and  refinement  of  axiomatic 
procedure;  thus,  from  the  material  axiomatics  of  the  ancient  Greeks  evolved 
the  formal  axiomatics  of  the  twentieth  century.  To  help  clarify  the  difference 
between  the  two  forms  of  axiomatics,  we  shall  first  introduce  the  modern 
concept  of  propositional  function ,  the  fundamental  importance  of  which  was 
first  brought  to  notice  by  the  English  mathematician  and  philosopher 
Bertrand  Russell  (1872-  ). 

Consider  the  three  statements:  (1)  Spring  is  a  season.  (2)  8  is  a  prime 
number.  (3)  *  is  a  y.  Each  of  these  statements  has  form— the  same  form; 
statements  (1)  and  (2)  have  content  as  well  as  form;  statement  (3)  has  form 
only.  Clearly,  statements  (1)  and  (2)  are  propositions,  one  true  and  the  other 
false.  Equally  clearly,  statement  (3)  is  not  a  proposition,  for,  since  it  asserts 
nothing  definite,  it  is  neither  true  nor  false,  and  a  proposition,  by  definition, 

*This  section  is  based  largely  upon  the  opening  essay,  “The  Meaning  of  Mathe¬ 
matics,”  in  C.  J.  Keyser  [4]  (Bibliography). 
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is  a  statement  which  is  true  or  false.  Statement  (3),  however,  though  not  a 
proposition,  does  have  the  form  of  a  proposition.  It  has  been  called  a 
propositional  function ,  for  if  in  the  form 

v  is  a  y, 

we  substitute  terms  of  definite  meaning  for  the  variables  x  and  y,  we  may 
obtain  propositions,  true  propositions  if  the  substituted  terms  verify  the 
propositional  function,  false  propositions  if  the  substituted  terms  falsify  the 
propositional  function.  It  is  apparent  that  some  substitutions  for  x  and  y 
convert  the  propositional  function  into  so  much  nonsense;  such  meanings 
for  the  variables  are  considered  inadmissible.  The  form  considered  above  is  a 
propositional  function  in  two  variables,  and  it  has  infinitely  many  verifiers. 

A  propositional  function  may  contain  any  number  of  variables.  An 
example  having  but  one  is  this:  x  is  a  volume  in  the  Library  of  Congress. 
Here  x  evidently  has  as  many  verifying  values  as  there  are  volumes  in  the 
Library  of  Congress.  Evidently,  too,  the  variable  x  has  many  falsifying  values. 

There  is  no  need  for  the  variables  in  a  propositional  function  to  be 
denoted  by  symbols,  such  as  x,  y,  •  •  • ;  they  may  be  ordinary  words.  Thus, 
should  a  statement  whose  terms  are  ordinary  words  appear  in  a  discourse 
with  no  indication  as  to  the  senses  in  which  the  words  are  to  be  understood, 
then  in  that  discourse  the  statement  is  really  a  propositional  function,  rather 
than  a  proposition,  and  in  the  interests  of  clarity  the  ambiguous  or  undefined 
terms  might  better  be  replaced  by  such  symbols  as  x,  y,  •  •  •.  Written  and 
spoken  discourse  often  contains  such  statements,  and  though  asserted  by 
their  authors  to  be  propositions,  true  or  false,  are  in  reality  propositional 
functions,  devoid  of  all  true  or  false  quality.  As  Keyser  has  observed,  *  this 
fact  perhaps  accounts  for  much  of  the  argument  and  misunderstanding 
among  people. 

With  the  idea  of  a  propositional  function  firmly  in  mind,  let  us  return 
to  a  discussion  of  axiomatic  procedure.  We  recall  that  any  logical  discourse, 
in  an  endeavor  to  be  clear,  tries  to  define  explicitly  all  elements  of  the 
discourse,  the  relations  among  these  elements,  and  the  operations  to  be 
performed  upon  them.  Such  definitions,  however,  must  employ  other  ele¬ 
ments,  relations,  and  operations,  and  these,  too,  are  subject  to  explicit 
definition.  If  these  are  defined,  it  must  again  be  by  reference  to  further 
elements,  relations,  and  operations.  There  are  two  roads  open  to  us;  either 
the  chain  of  definitions  must  be  cut  short  at  some  point,  or  else  it  must  be 
circular.  Since  circularity  is  not  to  be  tolerated  in  a  logical  discourse,  the 
definitions  must  be  brought  to  a  close  at  some  point;  thus  it  is  necessary 
that  one  or  more  elements,  relations,  and  operations  receive  no  explicit 
definition.  These  are  known  as  the  primitive  terms  of  the  discourse.  There  is 
likewise  an  effort  logically  to  deduce  the  statements  of  the  discourse,  and, 


*Loc.  cit. 
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again,  in  order  to  get  started  and  also  to  avoid  the  vicious  circle,  one  or 
more  of  the  statements  must  remain  entirely  unproved.  These  are  known  as 
the  postulates  (or  axioms,  or  primary  statements)  of  the  discourse.  Clearly, 
then,  any  logical  discourse  such  as  we  are  considering  must  conform  to  the 
following  pattern: 

Pattern  of  formal  axiomatics 

1 .  The  discourse  contains  a  set  of  technical  terms  (elements,  relations 
among  the  elements,  operations  to  be  performed  on  the  elements)  which 
are  deliberately  chosen  as  undefined  terms.  These  are  the  primitive  terms 
of  the  discourse. 

2.  All  other  technical  terms  of  the  discourse  are  defined  by  means  of  the 
primitive  terms. 

3.  The  discourse  contains  a  set  of  statements  about  the  primitive  terms 
which  are  deliberately  chosen  as  unproved  statements.  These  are  called 
the  postulates ,  or  primary  statements,  P,  of  the  discourse. 

4.  All  other  statements  (about  the  primitive  and  the  defined  terms)  of  the 
discourse  are  logically  deduced  from  the  postulates.  These  derived 
statements  are  called  the  theorems ,  7j  of  the  discourse. 

5.  For  each  theorem  73  of  the  discourse  there  exists  a  corresponding  state¬ 
ment  (which  may  or  may  not  be  formally  expressed)  asserting  that 
theorem  73  is  logically  implied  by  the  postulates  P.  (Often  a  corre¬ 
sponding  statement  appears  at  the  end  of  the  proof  of  the  theorem  in 
some  such  words  as,  “Hence  the  theorem,”  or  “This  completes  the 
proof  of  the  theorem”;  in  some  elementary  geometry  textbooks  the 
statement  appears,  at  the  end  of  the  proof  of  the  theorem,  as  “Q.E.D.,” 
(Quod  erat  demonstrandum).  The  modern  symbol  |,  suggested  by 
Paul  R.  Halmos,  is  frequently  used  to  signalize  the  end  of  a  proof.) 

Note  in  the  above  pattern  that  the  primitive  terms,  being  undefined 
terms,  might  just  as  well  (if  such  is  not  already  the  case)  be  replaced  by 
symbols  like  x,  y,  •  •  • .  Let  us  suppose  this  substitution  is  made.  Then  the 
primitive  terms  are  clearly  variables.  Second,  note  that  the  postulates,  P, 
since  they  are  statements  about  the  primitive  terms,  are  nothing  less  than 
propositional  functions.  Third,  note  that  the  theorems,  7j  since  they  are 
but  logical  implications  of  the  postulates  P,  also  are  propositional  functions. 
We  are  thus  brought  to  a  fact  of  cardinal  importance,  namely,  that  once 
the  primitive  terms  are  realized  to  be  variables,  both  the  postulates  and  the 
theorems  of  a  logical  discourse  are  not  propositions  but  propositional 
functions. 

Since  the  postulates  and  the  theorems  of  a  logical  discourse  are  propo¬ 
sitional  functions,  that  is,  are  statements  of  form  only  and  without  content, 
it  would  seem  that  the  whole  discourse  is  somewhat  vacuous  and  entirely 
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devoid  of  truth  or  falseness.  Such,  however,  is  not  the  case,  for  by  (5)  of  the 
postulational  pattern  we  have  the  all-important  statement, 

(6)  The  postulates  P  imply  the  theorems  T. 

Now  (6)  asserts  something  definite;  it  is  true  or  false,  and  so  is  a  proposition 
—  a  true  one  if  the  theorems  T  are  in  fact  implied  by  the  postulates  P,  and 
a  false  one  if  they  are  not.  Statement  (6)  is  precisely  what  the  discourse  is 
designed  for;  it  is  the  discourse’s  sole  aim  and  reason  for  being. 

A  discourse  conducted  according  to  the  above  pattern  has  been  called, 
by  some  mathematicians,  a  branch  of  pure  mathematics,  and  the  grand  total 
of  all  such  existing  branches  of  pure  mathematics,  the  pure  mathematics  up 
to  date. 

If,  for  the  variables  (the  primitive  terms)  in  a  branch  of  pure  mathe¬ 
matics,  we  should  substitute  terms  of  definite  meaning  that  convert  all  the 
postulates  of  the  branch  into  true  propositions,  then  the  set  of  substituted 
terms  is  called  an  interpretation  of  the  branch  of  pure  mathematics.  The 
interpretation  will  also,  provided  all  deductions  have  been  correctly  per¬ 
formed,  convert  the  theorems  of  the  discourse  into  true  propositions.  The 
result  of  such  an  interpretation  is  called  a  model  of  the  branch  of  pure 
mathematics. 

A  model  of  a  branch  of  pure  mathematics  has  been  called  a  branch  of 
applied  mathematics ,  and  the  grand  total  of  all  existing  branches  of  applied 
mathematics,  the  applied  mathematics  up  to  date.  Thus,  according  to  this 
definition,  the  difference  between  applied  and  pure  mathematics  is  not  one 
of  applicability  and  inapplicability,  but  rather  of  concreteness  and  abstract¬ 
ness.  Behind  every  branch  of  applied  mathematics  lies  a  branch  of  pure 
mathematics,  the  latter  being  an  abstract  development  of  what  formerly  was 
a  concrete  development.  It  is  conceivable  (and  indeed  such  is  often  the  case) 
that  a  single  branch  of  pure  mathematics  may  have  several  models,  or 
associated  branches  of  applied  mathematics.  This  is  the  “economy”  feature 
of  pure  mathematics,  for  the  establishment  of  a  branch  of  pure  mathematics 
automatically  assures  the  simultaneous  establishment  of  all  of  its  branches 
of  applied  mathematics. 

The  abstract  development  of  some  branch  of  pure  mathematics  is  an 
instance  of  formal  axiomatics ,  whereas  the  concrete  development  of  a  given 
branch  of  applied  mathematics  is  an  instance  of  material  axiomatics.  In  the 
former  case  we  think  of  the  postulates  as  prior  to  any  specification  of  the 
primitive  terms,  and  in  the  latter  we  think  of  the  objects  and  concepts  that 
interpret  the  primitive  terms  as  being  prior  to  the  postulates.  In  the  former 
case  a  postulate  is  simply  a  basic  assumption  about  some  undefined  primitive 
terms;  in  the  latter  case  a  postulate  expresses  some  property  of  the  basic 
objects  and  concepts  that  is  taken  as  initially  evident.  This  latter  is  the  older 
view  of  a  postulate,  and  was  the  view  held  by  some  of  the  ancient  Greeks. 
Thus,  to  these  Greeks,  geometry  was  thought  of  as  a  study  dealing  with  a 
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unique  structure  of  physical  space,  in  which  the  elements  points  and  lines 
are  regarded  as  idealizations  of  certain  actual  physical  entities,  and  in  which 
the  postulates  are  readily  accepted  statements  about  these  idealizations. 
From  the  modern  point  of  view,  geometry  is  a  purely  abstract  study  devoid 
of  any  physical  meaning  or  imagery. 

The  notion  of  mathematics  as  an  assemblage  of  abstract  postulational 
discourses  gives  considerable  sense  to  Bertrand  Russell’s  facetious  statement 
that  “mathematics  may  be  defined  as  the  subject  in  which  we  never  know 
what  we  are  talking  about,  nor  whether  what  we  are  saying  is  true.”*  It 
also  accords  with  Henri  Poincare’s  saying  that  mathematics  is  “the  giving 
of  the  same  name  to  different  things,”  f  and  with  Benjamin  Peirce’s  (1809- 
1880)  remark  that  “mathematics  is  the  science  which  draws  necessary 
conclusions.”  J 

6.2  A  Simple  Example  of  a  Branch 

of  Pure  Mathematics 

We  propose  to  give  in  this  section  a  simple  example  of  a  branch  of  pure 
mathematics  followed  by  three  applications  of  that  branch.  In  other  words, 
we  shall  develop  a  formal  axiomatic  discourse  and  then,  by  appropriate 
interpretations,  obtain  three  models  of  it.  To  this  end  consider  a  set  K  of 
undefined  elements  a,  b,  c,  •  •  • ,  and  an  undefined  dyadic  relation  R  con¬ 
necting  certain  pairs  of  elements  of  K.  If  element  a  is  related  to  element  b 
by  the  R  relation,  we  shall  write  a  R  b  and  read  “<z  is  Ahrelated  to  b”;  if 
element  a  is  not  related  to  element  b  by  the  R  relation,  we  shall  write  aRb 
and  read  “a  is  not  /^-related  to  6.”  If  elements  a  and  b  are  identical  (the 
same  element),  we  shall  write  a  =  b\  if  elements  a  and  b  are  distinct,  we 
shall  write  a  b.  We  now  list  four  unproved  statements,  or  postulates, 
concerning  the  elements  of  set  K  and  the  dyadic  relation  R. 

PI :  If  a  b,  then  either  a  R  b  or  b  R  a. 

P2:  If  a  R  b,  then  a  ^  b. 

P3:  If  a  R  b  and  b  R  c,  then  a  R  c. 

P4:  K  consists  of  exactly  four  distinct  elements. 

From  these  postulates  we  shall  now  logically  deduce  further  statements, 
or  theorems.  Theorems  will  be  designated  by  Tl,  T2,  •  •  •  and  definitions  by 
Dl,  D2,  • 

Tl :  //aRb,  then  b  K  a. 

Suppose  both  aRb  and  b  R  a.  Then,  by  P3,  a  R  a.  But  this  statement 
is  impossible  by  P2.  Hence  the  theorem  by  reductio  ad  absurdum. 

*B.  Russell  [1]. 

tQuoted  in  C.  J.  Keyser  [1],  p.  134. 

fB.  Peirce. 
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T2:  //  a  R  b,  and  c  is  in  K,  then  either  a  R  c  or  c  R  b. 

If  c  =  a,  then  c  R  b,  and  we  are  done.  If  c  9^  a,  we  have,  by  PI,  either 
a  R  c  or  c  R  a.  If  c  R  a,  since  also  a  R  b,  we  have,  by  P3,  c  R  b.  Hence  the 
theorem. 

T3:  There  is  at  least  one  element  of  K  not  R-r elated  to  any  element  of  K. 

Suppose  the  contrary  case  and  let  a  be  any  element  of  K.  Then,  by 
our  supposition,  there  exists  an  element  b  of  K  such  that  a  R  b.  By  P2, 
a  and  b  are  distinct  elements  of  K. 

By  our  supposition  there  exists  an  element  c  of  K  such  that  b  R  c. 
By  P2,  b  9^  c.  By  P3,  we  also  have  a  R  c.  By  P2,  a  9£  c.  Thus  a ,  b,  c  are 
distinct  elements  of  K. 

By  our  assumption  there  exists  an  element  d  of  K  such  that  c  R  d. 
By  P2,  c  9^  d.  By  P3,  we  also  have  b  R  d  and  a  R  d.  By  P2,  b  9^  d,  a  9£  d. 
Thus  a ,  b,  c,  d  are  distinct  elements  of  K. 

By  our  supposition  there  exists  an  element  e  of  K  such  that  d  R  e.  By 
P2,  d  9^  e.  By  P3,  we  also  have  c  R  e,  b  R  e,  a  R  e.  By  P2,  c  9^  e,  b  9£  e, 
a  9^  e.  Thus  a,  b,  c,  d,  e  are  distinct  elements  of  K. 

We  now  have  a  contradiction  of  P4.  Hence  the  theorem  by  reductio 
ad  absurdum. 

T4:  There  is  only  one  element  of  K  not  R- related  to  any  element  of  K. 

By  T3,  there  is  at  least  one  such  element,  say  a.  Let  b  9^  a  be  any  other 
element  of  K.  By  PI,  either  a  R  b  or  b  R  a.  But,  by  hypothesis,  we  do  not 
have  a  R  b.  Therefore  we  must  have  b  R  a,  and  the  theorem  is  proved. 

D1 :  If  b  R  a,  then  we  say  a  D  b. 

T5:  If  a  D  b  and  b  D  c,  then  a  D  c. 

By  Dl,  b  R  a  and  c  R  b.  By  P3,  we  then  have  c  R  a,  or,  by  Dl,  a  D  c. 

D2:  If  a  R  b  and  there  is  no  element  c  of  K  such  that  a  R  c  and  c  R  b, 

then  we  say  a  F  b. 

T6:  //aFc  and  b  F  c,  then  a  =  b. 

Suppose  a  9^  b.  Then,  by  PI,  either  a  R  b  or  b  R  a. 

Case  1.  Suppose  a  Rb.  Since  b  F  c,  by  D2,  b  R  c.  But  this  is  impossible, 
since  aFc. 

Case  2.  Suppose  b  R  a.  Since  aFc,  by  D2,  a  R  c.  But  this  is  impossible, 
since  b  F  c. 

Thus  in  either  case  we  are  led  to  a  contradiction  of  our  hypothesis. 
Hence  the  theorem  by  reductio  ad  absurdum. 

T7:  7/aFb  and  b  F  c,  then  a  f  c. 

By  D2,  a  Rb  and  b  R  c.  Hence,  again  by  D2,  a  C  c. 

D3:  If  a  F  b  and  b  F  c,  then  we  say  a  G  c. 

We  shall  cut  short  our  abstract  postulational  discourse  at  this  point. 
There  are,  of  course,  many  other  theorems  which  could  be  established  in 
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the  system,  but  probably  we  have  considered  enough  to  illustrate  the  notion 
of  a  branch  of  pure  mathematics.  It  is  worth  noting  that  a  number  of  our 
postulates  and  theorems  can  be  more  rhetorically  and  more  tersely  worded 
if  we  use  some  of  the  theory  of  relations  developed  in  Section  5.5  and  its 
associated  problems.  Thus  PI,  P2,  P3,  Tl,  T5,  and  T7  can  be  restated  as 

PI:  R  is  determinate  in  K. 

P2:  R  is  irreflexive  in  K. 

P3:  R  is  transitive  in  K. 

Tl :  R  is  asymmetric  in  K. 

T5:  D  is  transitive  in  K. 

T7:  F  is  intransitive  in  K. 

Theorem  T3  is  an  example  of  an  existence  theorem  and  Theorem  T4  is  an 
example  of  a  uniqueness  theorem.  Such  theorems  are  common  and  important 
in  many  bodies  of  mathematics. 

We  shall  now  give  three  applications  of  our  branch  of  pure  mathematics, 
thus  obtaining  three  derived  branches  of  applied  mathematics. 

Application  1  (Genealogical) 

Let  the  elements  of  K  be  four  men— some  man,  his  father,  his  father’s 
father,  and  his  father’s  father’s  father,  and  let  R  mean  “is  an  ancestor  of.” 

We  readily  see  that  this  interpretation  of  the  elements  of  K  and  of  the 
relation  R  converts  the  postulates  into  true  propositions.  We  are  thus  led 
to  a  concrete  model  of  our  abstract  postulational  discourse,  that  is,  to  a 
branch  of  applied  mathematics  derived  from  our  branch  of  pure  mathematics. 
The  theorems,  which  now  must  all  become  true  propositions,  and  the 
definitions  read: 

Tl(l):  If  a  is  an  ancestor  of  b,  then  b  is  not  an  ancestor  of  a. 

T2(l):  If  a  is  an  ancestor  of  b,  and  if  c  is  one  of  the  four  men,  then 
either  a  is  an  ancestor  of  c  or  c  is  an  ancestor  of  b. 

T3(l):  There  is  at  least  one  man  in  K  who  is  not  an  ancestor  of  anyone 
in  K. 

T4(l):  There  is  only  one  man  in  K  who  is  not  an  ancestor  of  anyone 
in  K. 

Dl(l):  If  b  is  an  ancestor  of  a ,  we  say  that  a  is  a  descendant  of  b. 

T5(l):  If  a  is  a  descendant  of  b  and  b  is  a  descendant  of  c,  then  a  is  a 

descendant  of  c. 

D2(l):  If  a  is  an  ancestor  of  b  and  there  is  no  individual  c  such  that  a  is 

an  ancestor  of  c  and  c  is  an  ancestor  of  b ,  we  say  that  a  is  a  father  of  b. 

T6(l):  A  man  has  at  most  one  father. 


Sec.  6.3  PROPERTIES  OF  POSTULATE  SETS  173 


T7(l):  If  a  is  the  father  of  b  and  b  is  the  father  of  c,  then  a  is  not  the 
father  of  c. 

D3(l):  If  a  is  the  father  of  b  and  b  is  the  father  of  c ,  we  say  that  a  is  a 
grandfather  of  c. 

Application  2  ( Geometrical ) 

Let  the  elements  of  K  be  four  distinct  points  on  a  horizontal  straight 
line,  and  let  R  mean  “is  to  the  left  of.” 

Again  our  postulates  are  satisfied  and  we  have, a  second  branch  of 
applied  mathematics  derived  from  our  branch  of  pure  mathematics.  The 
relation  D  means  “is  to  the  right  of,”  the  relation  F  means  “is  the  next 
point  of  K  to  the  left  of,”  and  relation  G  means  “is  the  second  point  of  K 
to  the  left  of.” 

Application  3  (Arithmetical) 

Let  the  elements  of  K  be  the  four  integers  1,  2,  3,  4,  and  let  R  mean 
“is  less  than.” 

Once  again  our  postulates  are  satisfied  and  we  have  a  third  branch  of 
applied  mathematics  derived  from  our  branch  of  pure  mathematics.  Here 
relation  D  means  “is  greater  than,”  relation  F  means  “is  1  less  than,”  and 
relation  G  means  “is  2  less  than.” 

Our  example  of  a  branch  of  pure  mathematics,  with  its  derived  branches 
of  applied  mathematics,  illustrates  the  “economy”  feature  of  the  modern 
axiomatic  method.  Any  theorem  of  the  branch  of  pure  mathematics  yields 
a  corresponding  theorem  in  each  of  the  applications,  and  these  latter  require 
no  proof  so  long  as  the  theorem  in  the  abstract  system  has  been  proved. 
The  abstract  postulate  set  studied  above  is  a  postulate  set  for  simple  order 
among  four  elements.  Any  relation  which  is  an  interpretation  of  the  first 
three  postulates*  is  called  a  simple  order  relation.  “Is  less  than,”  “is  greater 
than,”  “is  to  the  left  of,”  “is  to  the  right  of,”  “is  before,”  “is  after,”  are  all 
simple  order  relations. 

6.3  Properties  of  Postulate  Sets  — 

Equivalence  and  Consistency 

It  must  not  be  thought,  in  building  up  an  abstract  postulational  dis¬ 
course,  that  we  may  set  down  a  collection  of  symbols  for  undefined  terms 
and  then  list  for  postulates  an  arbitrary  system  of  assumed  statements  about 
these  terms.  There  are  certain  required  and  certain  desired  properties  which 
our  system  of  assumed  statements— our  postulates— should  possess.  This 

*These  postulates  were  first  studied  by  E.  V.  Huntington  in  1905.  See  E.  V. 

Huntington  [9]. 
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section  and  the  following  section  will  accordingly  be  devoted  to  a  brief 
examination  of  some  of  the  properties  of  postulate  sets.  Such  a  study  is 
technically  known  as  metamathematics ,  and  was  first  brought  into  prominence 
by  Hilbert’s  Grundlagen  der  Geometne.  Of  the  properties  of  postulate  sets,  we 
shall  consider  the  four  known  as  equivalence ,  consistency ,  independence ,  and 
completeness.  The  first  property  applies  to  pairs  of  postulate  sets,  and  the 
remaining  three  apply  to  individual  postulate  sets. 

Equivalence.  Two  postulate  systems  P(1)  and  P(2)  are  said  to  be  equivalent 
if  each  system  implies  the  other,  that  is,  if  the  primitive  terms  in  each  are 
definable  by  means  of  the  primitive  terms  of  the  other,  and  if  the  postulates 
of  each  are  deducible  from  the  postulates  of  the  other.  If  two  postulate 
systems  are  equivalent,  then  the  two  abstract  studies  implied  by  them  are, 
of  course,  the  same,  and  it  is  merely  a  matter  of  ‘‘saying  the  same  thing  in 
different  ways.”  The  idea  of  equivalent  postulate  systems  arose  in  ancient 
times  when  geometers,  dissatisfied  with  Euclid’s  parallel  postulate,  tried 
to  substitute  for  it  a  more  acceptable  equivalent.  The  modern  studies  of 
Euclidean  geometry,  with  their  various  and  quite  different  postulational 
bases,  clearly  illustrate  that  a  postulate  system  is  by  no  means  uniquely 
determined  by  the  study  in  question,  but  depends  upon  which  technical 
terms  of  the  study  are  chosen  as  primitive  and  which  statements  of  the  study 
are  taken  as  unproved. 

Since  a  given  study  may  be  built  upon  more  than  one  possible  postulate 
system,  the  question  naturally  arises  regarding  how  to  determine  which  of 
two  equivalent  postulate  systems  is  the  better.  There  is  perhaps  no  simple 
criterion  for  comparing  a  pair  of  equivalent  postulate  sets  with  the  aim  of 
determining  which  set  is  the  better;  it  seems  to  be  largely  a  matter  of  personal 
preference.  On  grounds  of  economy  of  assumption,  it  might  seem  that  that 
set  which  contains  fewer  primitive  terms  and  fewer  postulates  is  the  one  to 
be  preferred.  There  is  certainly  something  satisfying  in  reducing  the  number 
of  primitive  terms  to  the  barest  minimum,  but  it  is  easy  to  see  that  the  idea 
of  reducing  the  number  of  postulates  to  a  minimum  is  somewhat  illusory, 
for  one  could,  if  one  wished,  lump  all  the  postulates  of  a  given  set  into  one 
big,  but  complicated,  postulate  by  using  a  proper  number  of  conjunctions. 
On  the  other  hand,  there  might  be  reason  to  think  a  postulate  set  better, 
because  it  would  seem  to  be  in  a  sense  simpler,  if  each  postulate  were  to  be 
broken  down  into  as  many  separate  statements  as  possible.  But  this,  too, 
has  little  significance,  as  Olaf  Helmer  amusingly  pointed  out  by  the  following 
example:*  The  simple  statement,  “There  is  one  and  only  one  x  which 
satisfies  g(x),”  can  be  replaced  by  the  five  statements: 

(1)  The  number  of  x’s  which  will  satisfy  g(x)  is  odd. 

(2)  The  number  of  x’s  which  will  satisfy  g(x)  is  less  than  8. 


*0.  Helmer. 
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(3)  The  number  of  x’s  which  will  satisfy  g(x)  is  not  equal  to  7. 

(4)  The  number  of  v’s  which  will  satisfy  g(x)  is  not  equal  to  5. 

(5)  The  number  of  v’s  which  will  satisfy  g(x)  is  not  equal  to  3. 

It  is  obvious,  of  course,  that  such  a  procedure  may  be  extended  indefinitely. 

Finally,  one  postulate  set  may  lead  more  rapidly  than  another  to  the  key 
theorems  of  a  study  and  may  for  this  reason  be  preferred  to  the  other.  But 
if  one’s  concern  is  to  reach  certain  key  results  as  expeditiously  as  possible, 
one  can  always  include  these  key  results  right  in  the  postulate  set  and  thus 
do  away  with  all  delay  in  reaching  them. 

As  a  postulate  set  equivalent  to  that  of  the  example  developed  in 
Section  6.2  we  might  use  PI,  Tl,  P3,  P4.  Since  T1  has  already  been  derived 
from  PI,  P2,  P3,  P4,  it  suffices  to  show  that  P2  can  be  derived  from  PI,  Tl, 
P3,  P4.  This  may  be  accomplished  as  follows: 

P2:  //aRb,  then  a  ^  b. 

Suppose,  on  the  contrary,  that  we  have  aRb  and  a  =  b.  Then  we  also 
have  b  R  a.  But  this  is  impossible  by  Tl .  Hence  P2  follows  by  reductio  ad 
absurdum. 

Consistency.  A  postulate  set  is  said  to  be  consistent  if  contradictory  state¬ 
ments  are  not  implied  by  the  set.  This  is  the  most  important  and  most 
fundamental  property  of  a  postulate  set;  without  this  property  the  postulate 
set  is  worthless,  and  it  is  useless  to  consider  any  further  properties  of  the  set. 

The  most  successful  method  so  far  invented  for  establishing  consistency 
of  a  postulate  set  is  the  method  of  models.  A  model  of  a  postulate  set,  it 
will  be  recalled,  is  obtained  if  we  can  assign  meanings  to  the  primitive  terms 
of  the  set  which  convert  the  postulates  into  true  statements  about  some 
concept.  There  are  two  types  of  models— concrete  models  and  ideal  models. 
A  model  is  said  to  be  concrete  if  the  meanings  assigned  to  the  primitive  terms 
are  objects  and  relations  adapted  from  the  real  world,  whereas  a  model  is 
said  to  be  ideal  if  the  meanings  assigned  to  the  primitive  terms  are  objects 
and  relations  adapted  from  some  other  postulate  system. 

Where  a  concrete  model  has  been  exhibited  we  feel  that  we  have  estab¬ 
lished  the  absolute  consistency  of  our  postulate  system,  for  if  contradictory 
theorems  are  implied  by  our  postulates,  then  corresponding  contradictory 
statements  would  hold  in  our  concrete  model.  But  contradictions  in  the  real 
world  we  accept  as  being  impossible.  As  an  illustration,  consider  the  postulate 
set  for  the  miniature  branch  of  mathematics  developed  in  the  preceding 
section.  In  the  genealogical  application  made  there,  we  interpreted  the  four 
primitive  elements  of  K  to  be  a  man,  his  father,  his  father’s  father,  and  his 
father’s  father’s  father,  and  we  interpreted  the  primitive  /^-relation  to  be 
“is  an  ancestor  of.”  We  noted  that  these  interpretations  convert  the  postulates 
into  true  statements  about  a  simple  genealogical  concept.  But  the  meanings 
assigned  to  the  four  objects  and  the  connecting  relation  have  been  adapted 
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from  the  real  world,  and  we  therefore  have  a  resulting  concrete  model  of 
our  postulate  system.  Any  inconsistency  in  our  branch  of  mathematics  would 
yield  a  corresponding  inconsistency  concerning  the  genealogical  relationship 
of  our  four  men.  But  since  the  real  world,  we  believe,  will  not  permit 
contradictions  to  exist  within  it,  we  also  believe  that  we  have  established  the 
absolute  consistency  of  our  postulate  set.  Many  other  concrete  models  of 
the  same  postulate  set  can  easily  be  given.  For  example,  we  might  let  K 
consist  of  four  books  standing  side  by  side  on  a  bookshelf,  and  let  the  ^-rela¬ 
tion  mean  “precedes,”  or  we  might  let  K  consist  of  four  people  walking 
Indian  file,  and  let  the  ^-relation  mean  “is  in  front  of.” 

As  another  example  of  a  proof  of  absolute  consistency,  consider  the 
postulate  system  for  plane  projective  geometry  as  given  in  Section  4.5. 
A  concrete  model  of  the  finite  geometry  consisting  of  the  seven  symbolic 
points  A,  B ,  C,  D,  E,  F,  G  and  the  seven  symbolic  lines  a,  b,  c,  d,  e,  /,  g  is 
set  up  by  the  following  table,  in  which  an  asterisk  at,  say,  the  intersection 
of  row  C  and  column  /  indicates  that  point  C  lies  on  line  /. 


a 

b 

c 

d 

e 

/ 

g 

A 

* 

* 

* 

B 

* 

* 

* 

C 

* 

* 

* 

D 
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* 

* 

E 

* 

* 

* 

F 

* 

* 

* 

G 

* 

* 

* 

Since  the  postulates  for  plane  projective  geometry  can  be  verified  for  our 
interpretation  by  means  of  this  table,  it  follows  that  the  table  furnishes  a 
concrete  model  of  the  geometry,  and  that  the  postulate  set  is  therefore 
absolutely  consistent. 

It  is  not  always  feasible  to  try  to  set  up  a  concrete  model  of  a  given 
postulate  set.  Thus,  if  the  postulate  set  contains  an  infinite  number  of 
primitive  elements,  a  concrete  model  would  certainly  be  impossible,  for  the 
real  world  does  not  contain  an  infinite  number  of  objects.  In  such  instances 
we  attempt  to  set  up  an  ideal  model,  by  assigning  to  the  primitive  terms  of 
postulate  system  A,  say,  concepts  of  some  other  postulate  system  B,  in  such 
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a  manner  that  the  interpretations  of  the  postulates  of  system  A  are  logical 
consequences  of  the  postulates  of  system  B.  But  now  our  test  of  consistency 
of  the  postulate  set  A  can  no  longer  claim  to  be  an  absolute  test,  but  only  a 
relative  test.  All  we  can  say  is  that  postulate  set  A  is  consistent  if  postulate 
set  B  is  consistent,  and  we  have  reduced  the  consistency  of  system  A  to  that 
of  another  system  B.  ' 

Relative  consistency  is  the  best  we  can  hope  for  when  we  apply  the 
method  of  models  to  many  of  the  branches  of  mathematics,  for  many  of  the 
branches  of  mathematics  contain  an  infinite  number  of  primitive  elements. 
Consider,  as  an  illustration,  the  postulate  set  of  Section  6.2,  with  P4  replaced 
by  P'4:  K  consists  of  an  infinite  number  of  distinct  elements.  As  an  interpretation 
of  this  postulate  set  we  might  let  the  collection  R  be  the  set  of  all  positive 
integers  and  let  the  ^-relation  mean  “is  less  than.”  But  the  resulting  model 
is  an  ideal  one,  adapted  from  the  arithmetic  of  the  positive  integers,  and  all 
we  can  say  is  that  our  postulate  set  is  consistent  if  the  arithmetic  of  the 
positive  integers  is  consistent. 

The  idea  of  relative  consistency  was  encountered  when,  in  Chapter  Four, 
we  endeavored  to  show  the  consistency  of  Lobachevskian  plane  geometry. 
By  employing  certain  concepts  from  Euclidean  plane  geometry,  we  were  able 
to  set  up  an  ideal  model  known  as  the  Poincare  model,  and  we  succeeded  in 
showing  that  Lobachevskian  plane  geometry  is  consistent  if  Euclidean  plane 
geometry  is  consistent.  The  idea  of  analytic  geometry,  on  the  other  hand, 
sets  up  an  ideal  arithmetical  model  of  Euclidean  plane  geometry,  showing 
that  Euclidean  plane  geometry  is  consistent  if  the  real  number  system  is 
consistent.  It  also  follows,  then,  that  the  Lobachevskian  plane  geometry  is 
consistent  if  the  real  number  system  is  consistent.  In  this  way,  the  consistency 
of  many  branches  of  mathematics  can  be  reduced  to  the  consistency  of  some 
basic  branch,  which,  for  the  larger  part  of  mathematics  and  the  whole  of 
physics,  is  the  real  number  system.  This  important  idea  will  be  a  main  topic 
of  interest  in  the  next  chapter. 

Proof  of  consistency  by  the  method  of  models  is  an  indirect  process. 
It  is  conceivable  that  absolute  consistency  may  be  established  by  a  direct 
procedure  which  endeavors  to  show  that  by  following  the  rules  of  deductive 
inference  no  two  theorems  can  be  arrived  at  from  a  given  postulate  set  which 
will  contradict  each  other.  In  such  a  procedure  a  complete  enumeration  of 
the  permissible  rules  of  logic  is,  of  course,  necessary.  In  recent  years,  Hilbert 
attacked  the  problem  of  securing  the  consistency  of  the  real  number  system 
in  such  a  direct  manner,  but  with  only  partial  success.  Since  this  method 
depends  on  the  rules  of  logical  inference,  any  change  in  those  rules  could 
upset  a  previously  established  consistency  proof  of  this  sort.  An  advantage 
of  the  method  of  models  is  that  it  is  independent  of  the  “rules  of  the  game.” 

Although  the  direct  method  of  establishing  consistency  is  too  compli¬ 
cated  to  illustrate  here,  we  can  clarify  the  idea  of  the  method  by  an  analogue 
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in  chess.  *  Suppose  we  wish  to  show  that  in  a  game  of  chess,  no  matter  how 
many  moves  might  be  made,  one  can  never,  if  one  plays  in  accordance  with 
the  rules,  arrive  at  the  situation  where  there  are  10  queens  of  the  same  color 
on  the  board.  Here  the  direct  method  is  applicable,  for  we  can  prove  from 
the  rules  of  the  game  that  no  move  can  increase  the  sum  of  the  number  of 
queens  and  pawns  of  the  same  color.  Since  this  sum  is  initially  9,  it  must 
remain  5=9. 

We  shall  describe  a  postulate  set  which  possesses  a  concrete  model  as 
being  verifiable.  Of  course  a  verifiable  postulate  set  is  absolutely  consistent, 
but  it  is  conceivable  (and  indeed  is  actually  a  known  fact)  that  an  absolutely 
consistent  postulate  set  need  not  be  verifiable. 


6.4  Properties  of  Postulate  Sets  — 

Independence  and  Completeness 

In  this  section  we  shall  consider  the  two  properties  of  a  postulate  set 
known  as  independence  and  completeness,  f  These  two  properties  differ  from 
that  of  consistency,  considered  in  the  previous  section,  in  that  they  are  not 
required  properties  but  rather  are  properties  which,  as  we  shall  see,  are 
sometimes  desired  and  sometimes  not  desired.  We  henceforth  assume  that 
our  postulate  set  is  consistent,  for  there  is  little  point  in  studying  independence 
and  completeness  of  an  inconsistent  postulate  set. 

Independence.  A  postulate  of  a  postulate  set  is  said  to  be  independent  in  the 
set  if  it  is  not  a  logical  consequence  of  the  other  postulates  of  the  set,  and 
the  entire  postulate  set  is  said  to  be  independent  if  each  of  its  postulates  is 
independent. 

The  most  famous  consideration  in  the  history  of  mathematics  of  the 
independence  of  a  postulate  is  that  associated  with  the  study  of  Euclid’s 
parallel  postulate.  For  centuries  mathematicians  had  difficulty  in  regarding 
the  parallel  postulate  as  independent  of  Euclid’s  other  postulates  (and 
axioms),  and  accordingly  made  repeated  attempts  to  show  that  it  was  a 
consequence  of  these  other  assumptions.  It  was  the  discovery  of,  and  the 
ultimate  proof  of  the  relative  consistency  of,  Lobachevskian  non-Euclidean 
geometry  that  finally  established  the  independence  of  Euclid’s  parallel  postu¬ 
late.  In  fact,  it  is  no  exaggeration  to  say  that  the  historical  consideration  of 
the  independence  of  Euclid’s  parallel  postulate  is  responsible  for  initiating 
the  entire  study  of  properties  of  postulate  sets  and  hence  for  the  shaping  of 
much  of  the  modern  axiomatic  method. 

The  manner  in  which  Euclid’s  parallel  postulate  was  finally  shown  to 
be  independent  furnishes  us  with  a  general  test  for  the  independence  of  a 

*See  H.  Weyl  [6],  pp.  23,  24. 

fThese  properties  are  given  somewhat  varying  definitions  by  different  writers. 
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postulate  in  a  postulate  set.  The  test  consists  in  finding  an  interpretation 
of  the  primitive  terms  of  the  postulate  set  which  falsifies  the  concerned  pos¬ 
tulate  but  which  verifies  each  of  the  remaining  postulates.  If  we  are  successful 
in  finding  such  an  interpretation,  then  the  concerned  postulate  cannot  be  a 
logical  consequence  of  the  remaining  postulates,  for  if  it  were  a  logical  con¬ 
sequence  of  the  remaining  postulates,  then  the  interpretation  which  converts 
all  the  other  postulates  into  true  propositions  would  have  to  convert  it  also 
into  a  true  proposition.  A  test,  along  these  lines,  of  the  independence  of  an 
entire  set  of  postulates  can  apparently  be  a  lengthy  business,  for  if  there  are 
n  postulates  in  the  set,  n  separate  tests  (one  for  each  postulate)  will  have  to 
be  formulated. 

Let  us  show  that  the  postulate  set  studied  in  Section  6.2  is  an  inde¬ 
pendent  set. 

To  show  the  independence  of  Postulate  PI,  let  us  interpret  K  as  con¬ 
sisting  of  two  brothers,  their  father,  and  their  father’s  father,  and  interpret 
the  Ahrelation  as  “is  an  ancestor  of.”  This  interpretation  verifies  P2,  P3, 
and  P4  but  falsifies  PI. 

To  show  the  independence  of  Postulate  P2,  we  may  interpret  K  as  the 
set  of  integers  1,  2,  3,  4,  and  the  /^-relation  as  “is  not  greater  than.”  This 
interpretation  verifies  each  postulate  except  P2. 

To  show  the  independence  of  Postulate  P3,  let  us  interpret  A  as  a  set 
of  any  four  distinct  elements  and  the  A-relation  as  “is  not  identical  with.” 
Now  all  the  postulates  except  P3  are  verified. 

Finally,  to  show  the  independence  of  Postulate  P4,  one  may  interpret 
A  as  the  set  of  five  integers  1,  2,  3,  4,  5  and  the  A-relation  as  “is  less  than.” 
All  the  postulates  except  P4  are  verified  by  this  interpretation. 

Independence  of  a  postulate  set  is  by  no  means  necessary,  and  a  postulate 
set  clearly  is  not  invalidated  just  because  it  lacks  independence.  Generally 
speaking,  a  mathematician  prefers  a  postulate  set  to  be  independent,  for  he 
wants  to  build  his  theory  on  a  minimum  amount  of  assumption.  A  postulate 
set  which  is  not  independent  is  merely  redundant  in  that  it  contains  one  or 
more  statements  which  ought  perhaps  to  appear  as  theorems  instead  of  as 
postulates.  On  the  other  hand,  there  are  occasions  when  it  is  a  decided 
advantage  to  have  a  postulate  set  not  independent.  For  example,  in  the 
teaching  of  mathematics,  one  may  wish,  for  pedagogical  reasons,  to  develop 
a  subject  from  a  postulational  basis.  However,  as  sometimes  happens,  an 
early  theorem  in  the  development  may  be  very  difficult  to  prove.  This 
theorem  can  then  be  stated  as  one  of  the  postulates;  later,  when  the  students 
have  gained  the  requisite  mathematical  maturity  and  familiarity  with  the 
subject,  it  can  be  pointed  out  that  the  postulate  is  really  not  independent 
and  a  demonstration  of  it  from  the  other  postulates  can  be  given. 

There  are  some  well-known  postulate  sets  which,  when  first  published, 
unknowingly  contained  postulates  that  were  not  independent.  Such  was  the 
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situation  with  Hilbert’s  original  set  of  postulates  for  Euclidean  geometry. 
This  set  was  later  shown  to  possess  two  postulates  which  are  implied  by  the 
others.  The  finding  of  these  two  dependent  postulates  in  no  way  invalidated 
Hilbert’s  system;  in  a  subsequent  amendment  these  postulates  were  merely 
changed  to  theorems,  and  their  proofs  supplied. 

Similarly,  R.  L.  Wilder  was  able  to  show  that  R.  L.  Moore’s  famous  set 
of  eight  postulates,  which  virtually  inaugurated  modern  set-theoretic 
topology,  could  be  reduced  to  seven  by  the  elimination  of  Moore’s  sixth 
postulate.  *  The  suspicion  that  the  sixth  postulate  was  not  independent  arose 
from  the  fact  that  the  independence  proof  for  this  postulate  was  found  to 
be  at  fault,  and  a  subsequent  search  for  a  satisfactory  proof  turned  out  to 
be  fruitless.  Of  course,  Moore’s  mathematical  theory  remained  intact  in 
spite  of  Wilder’s  discovery,  but  the  reduction  of  an  eight-postulate  system 
to  an  equally  effective  seven-postulate  system  has  an  aesthetic  appeal  to 
the  mathematician. 

Again,  however,  as  in  the  above  discussion  of  equivalence  of  postulate 
systems,  the  actual  number  of  postulates  used  as  a  basis  is  of  little  significance 
in  connection  with  the  property  of  independence.  Any  postulate  set,  inde¬ 
pendent  or  not,  can  easily  be  converted  into  an  independent  set  having  but 
one  postulate  by  the  simple  device  of  lumping  all  the  separate  postulates 
of  the  set  into  one  big  postulate  by  means  of  conjunctions. 

For  later  use  we  shall  describe  a  postulate  p  of  a  postulate  set  P  to  be 
strongly  independent  in  P  if  both  P  and  (P  —  p)  +  not -p  are  verifiable.  From 
the  above  discussion  we  realize  that  if  a  postulate  p  is  strongly  independent 
in  a  postulate  set  P,  then  p  is  independent  in  P,  but  it  is  conceivable  that  a 
postulate  p  may  be  independent  in  a  postulate  set  P  and  yet  not  be  strongly 
independent  in  P. 

Completeness.  The  property  of  completeness  is  more  recondite  than  the 
three  properties  already  described,  and  so  it  might  be  well  to  precede  the 
definition  of  completeness  with  some  motivation  for  it.  Consider  for  the 
moment,  then,  the  hypothetical  task  of  building  up  a  postulate  set  for  plane 
Euclidean  geometry.  The  first  thing  we  might  do  is  to  select  our  primitive 
terms.  These  must  constitute  a  collection  of  technical  terms  of  the  geometry 
such  that  all  other  technical  terms  of  the  study  can  be  defined  by  means  of 
them.  The  next  thing  we  might  do  is  to  begin  formulating  a  growing  list 
of  mutually  compatible  statements  about  the  primitive  terms.  These  will  be 
our  postulates.  Here  a  problem  arises.  When  can  we  cut  short  our  growing 
list  of  postulates?  We  want  our  postulate  set  to  be  ample  enough  to  imply 
the  ‘"truth”  or  the  “falseness”  of  any  possible  statement  in  plane  Euclidean 
geometry.  In  other  words,  we  want  to  have  a  sufficient  number  of  postulates 
so  that  if  S  is  any  statement  whatever  concerning  the  primitive  terms,  then 

*R.  L.  Moore  [2]  and  R.  L.  Wilder  [1]. 
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either  S  or  its  contradictory  statement,  not-S,  is  implied  by  our  postulates. 
If  the  postulate  set  is  not  sufficiently  ample,  there  will  certainly  be  some 
statements  of  the  geometry  which  cannot  be  reached  from  our  postulates. 
This  condition  would  exist,  for  example,  if  we  had  chosen  Hilbert’s  collection 
of  primitive  terms  and  all  of  Hilbert’s  postulates  except,  say,  the  parallel 
postulate.  This  slightly'  truncated  postulate  set  could  never  decide  for  us 
whether  or  not  the  sum  of  the  angles  of  a  triangle  is  always  equal  to  180°, 
for  our  truncated  postulate  set  is  common  to  the  postulate  sets  of  both 
Euclidean  and  Lobachevskian  plane  geometry.  Apparently  our  list  of 
postulates  will  be  sufficiently  ample  if  a  point  is  reached  when  it  becomes 
impossible  to  add  to  our  list  any  further  statement  which  will  be  both 
independent  of,  and  consistent  with,  the  postulates  so  far  formulated.  If 
such  a  point  can  be  reached,  then  we  have  what  may  be  called  a  complete 
postulate  set  for  the  geometry. 

We  may  now  give  the  following  formal  definition  of  completeness.  A 
consistent  postulate  set  is  said  to  be  complete  if  it  is  impossible  to  add  to  the 
set,  without  extending  the  collection  of  primitive  terms,  another  postulate 
which  is  both  independent  of,  and  consistent  with,  the  given  postulates. 

Closely  allied  to  the  concept  of  completeness  is  a  concept  which  we  shall 
call  weak  completeness.  A  postulate  set  P  will  be  said  to  be  weakly  complete  if 
it  is  impossible  to  add  to  the  set,  without  extending  the  collection  of  primitive 
terms,  another  postulate  p  that  is  strongly  independent  in  the  augmented 
set  P  +  p.  In  other  words,  postulate  set  P  is  weakly  complete  if  it  is  impossible 
to  add  to  P,  without  extending  the  collection  of  primitive  terms,  another 
postulate  p  such  that  P  +  p  and  [(P  +  p)  —  p]  +  not -p  =  P  +  not -p  are 
both  verifiable.  It  should  be  clear  to  the  reader  that  weak  completeness  does 
not  necessarily  imply  completeness,  but  completeness  does  imply  weak 
completeness. 

The  definitions  of  completeness  and  weak  completeness,  like  that  of 
consistency,  are  not  easily  adapted  to  testing  a  postulate  set  for  the  property 
concerned.  A  test  of  consistency  was  devised  that  depends  on  the  concept  of 
verifiability  of  a  postulate  set.  A  test  for  weak  completeness  can  be  devised 
that  depends  upon  a  concept  called  categoricalness  of  a  postulate  set.  We  will 
proceed  now  to  explain  the  meaning  of  this  new  concept. 

Among  the  primitive  terms  of  a  postulate  set  P  we  have  a  collection 
of  P’s,  say,  which  denote  elements,  perhaps  some  relations  R i,  R2, 
among  the  elements,  and  perhaps  some  operations  0 1,  02,  •  •  •  upon  the 
elements.  Accordingly,  an  interpretation  of  the  postulate  set  is  composed 
at  least  in  part  of  element  constants  (the  meanings  assigned  to  the  E  s), 
perhaps  in  part  of  relation  constants  (the  meanings  assigned  to  the  R  s), 
and  perhaps  in  part  of  operation  constants  (the  meanings  assigned  to  the  O’s). 
Now  in  any  given  interpretation  I  of  P,  let  a  collection  of  *’s  be  the  element 
constants  (representing  the  E' s),  r  1,  r2,  •  •  •  the  relation  constants  (repre- 
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senting  the  R’ s),  and  o  i,  o2,  •  •  •  the  operation  constants  (representing  the 
0’s);  and  in  any  other  interpretation  /'  of  P  let  the  element  constants  be  a 
collection  of  ef,s,  the  relation  constants  r\,  r'2,  •••,  and  the  operation 
constants  o' i,  •  •  •.  If  it  is  possible  to  set  up  a  one-to-one  correspondence 
between  the  elements  e  of  /  and  the  elements  e'  of  I'  in  such  a  way  that,  if 
two  or  more  of  the  r’s  are  related  by  some  r,  the  corresponding  e'*s  are 
related  by  the  corresponding  r',  and  if  an  o  operating  on  one  or  more  of  the 
e’s  yields  an  e,  the  corresponding  o'  operating  on  the  corresponding  ef,s 
yields  the  corresponding  e',  then  we  say  that  the  two  interpretations  I  and  I' 
of  P  are  isomorphic.  This  definition  is  often  more  briefly  stated  by  saying 
that  two  interpretations  /  and  I'  of  a  postulate  set  P  are  isomorphic  if  one 
can  set  up  a  one-to-one  correspondence  between  the  elements  of  I  and  those 
of  I'  in  such  a  way  as  “to  be  preserved  by  the  relations  and  the  operations 
of  P.”  It  follows  that  if  two  interpretations  /  and  I'  of  a  postulate  set  P  are 
isomorphic,  then  any  true  (false)  proposition  p  in  interpretation  /  becomes  a 
true  (false)  proposition  p'  in  interpretation  I'  when  we  replace  the  «?’s,  r’s, 
and  o’s  in  p  by  their  corresponding  P’s,  rn s,  and  P’s.  Two  isomorphic 
interpretations  of  a  postulate  set  P  are,  except  for  superficial  differences 
in  terminology  and  notation,  identical;  they  differ  from  each  other  no  more 
than  does  the  multiplication  table  up  to  10  X  10  when  correctly  written 
first  in  English  and  then  in  French. 

With  the  notion  of  isomorphic  interpretations  of  a  postulate  set  estab¬ 
lished,  we  are  prepared  to  define  categoricalness  of  a  postulate  set.  A 
postulate  set  P  is  said  to  be  categorical  if  every  two  interpretations  of  P 
are  isomorphic.  * 

We  shall  now  show  that  categoricalness  of  a  consistent  postulate  set  P 
implies  weak  completeness  of  P.  To  this  end  suppose  that  P  is  categorical 
but  not  weakly  complete.  Since  P  is  not  weakly  complete  there  exists  a 
statement  />,  about  the  primitive  terms  of  P,  such  that  both  P  +  p  and 
P  not -p  are  verifiable.  That  is,  P  +  p  possesses  an  interpretation  I  and 
P  not -p  possesses  an  interpretation  I'.  Since  P  is  categorical,  there  is  a 
one-to-one  correspondence  between  the  elements  of  /  and  the  elements  of  /' 
such  that  corresponding  propositions  in  the  two  resulting  models  are  either 
both  true  or  both  false.  But  this  is  impossible,  since  p  is  a  true  proposition 
for  interpretation  I  but  a  false  proposition  for  interpretation  I'. 

In  practice  it  is  difficult  to  establish  completeness  of  a  set  of  postulates. 
To  show  incompleteness  one  has  merely  to  produce  two  nonisomorphic 
interpretations  of  the  postulate  set.  Sometimes  weak  completeness  can  be 
established  by  showing  that  any  interpretation  of  the  postulate  set  is  iso¬ 
morphic  to  some  given  interpretation.  This  is  the  process  that  has  been 

*It  seems  that  the  adjective  categorical  was  introduced  into  the  literature  by  Veblen, 
who  received  the  suggestion  from  John  Dewey.  Huntington  had  earlier  employed 
the  adjective  sufficient ,  and  Hilbert  used  the  adjective  complete. 
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applied  to  Hilbert’s  postulate  set  for  plane  Euclidean  geometry;  it  can  be 
shown  that  any  interpretation  of  Hilbert’s  postulates  is  isomorphic  to  the 
algebraic  interpretation  provided  by  Descartes’s  analytic  geometry.  In  this 
way  it  can  also  be  shown  that  the  simple  postulate  set  of  Section  6.2  is 
weakly  complete.  If,  on  the  other  hand,  we  delete  Postulate  P4  from  the 
postulate  set  of  Section'6.2,  then  the  remaining  postulate  set  is  incomplete. 
This  is  readily  seen  by  interpreting  R  as  “is  less  than”  and  by  taking  A  to 
consist  in  turn  of  the  three  integers  1,  2,  3  and  of  the  four  integers  1,  2,  3,  4. 
Each  of  these  interpretations  verifies  Postulates  PI,  P2,  and  P3,  but  the 
interpretations  are  not  isomorphic,  since  it  is  impossible  to  set  up  a  one-to- 
one  correspondence  between  the  three  elements  of  the  one  interpretation 
and  the  four  elements  of  the  other  interpretation. 

There  are  advantages  and  disadvantages  in  having  a  postulate  set 
complete.  Perhaps  the  most  desirable  feature  of  an  incomplete  postulate  set 
is  its  wide  range  of  applicability.  This  feature  is  strikingly  seen  in  the  postulate 
set  of  Section  6.2,  where  the  deletion  of  Postulate  P4  yields  an  incomplete 
postulate  set  which  may  be  interpreted  for  any  n  sequentially  related  elements 
—  indeed,  we  may  even  take  n  infinite.  When  we  wish  to  study  a  structure 
that  is  common  to  a  number  of  more  complicated  structures,  an  incomplete 
postulate  system  is,  of  course,  necessary.  There  are,  for  example,  many 
structures  which  contain  the  group  as  a  substructure,  and  it  is  therefore 
convenient  and  economical  to  have  the  group  properties  worked  out  once 
and  for  all  from  an  appropriate  incomplete  postulate  set.  A  similar  remark 
can  be  made  about  the  postulate  sets  governing  such  important  structures 
as  fields,  rings,  simple  order,  equivalence,  and  so  on.  Frequently,  in  building 
up  a  given  theory  by  the  postulational  method,  it  is  desirable  to  introduce 
the  postulates  one  by  one  or  in  related  sets,  so  that  it  becomes  clear  just 
how  the  theorems  of  the  study  depend  upon  the  underlying  postulates.  * 
For  example,  the  theorems  common  to  Euclidean  and  Lobachevskian  plane 
geometry  may  be  found  from  Hilbert’s  postulate  set  with  the  parallel  postu¬ 
late  deleted.  Such  partial  postulate  sets  are,  of  course,  incomplete,  and  may 
be  verified  by  nonisomorphic  interpretations.  On  the  other  hand,  one 
certainly  hopes  to  have  complete  postulate  sets  for  certain  significant  branches 
of  mathematics,  like  Euclidean  geometry  and  the  real  number  system  (which 
lies  at  the  basis  of  analysis).  If  p  is  any  statement  about  the  primitive  terms 
of  such  a  postulate  set,  then  of  p  and  its  contradictory  statement,  not-/?,  one 
and  only  one  is  implied  by  the  postulate  set. 


6.5  Miscellaneous  Comments 

We  shall  conclude  the  present  chapter  with  a  few  additional  miscel¬ 
laneous  comments  about  postulate  sets  and  the  modern  axiomatic  method. 

*See,  for  example,  D.  Hilbert  [1],  H.  G.  Forder  [1],  and  W.  Sierpinsky. 
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There  are  properties  of  postulate  sets  other  than  those  which  we  have 
already  described,  which  have  been  considered  by  mathematicians.  Thus 
E.  H.  Moore  introduced,  in  1910,  a  property  that  he  called  “complete 
independence”  of  a  postulate  set.  *  Moore  defined  a  postulate  set  P  to  be 
completely  independent  if,  for  every  proper  subsystem  E\,  Py,  •  •  • ,  Pk  of  the 
postulates,  both  P  and  P  with  each  of  the  postulates  P\,  Py,  •  •  • ,  Pk  replaced 
by  its  contradictory  statement  are  consistent.  This  definition  is  a  general¬ 
ization  of  the  definition  of  independence  of  a  postulate  set  as  given  in  Section 
6.4.  It  can  be  shown  that  an  independent  postulate  set  is  not  necessarily 
completely  independent,  f 

Closely  related  to  the  concepts  of  independence  of  a  postulate  and  of  a 
postulate  set  are  the  concepts  of  relative  “weakness”  of  a  postulate  and  of  a 
postulate  set.  Given  two  postulates  Pz  and  P\,  we  say  that  P\  is  weaker  than 
Pi  (and  Pi  is  stronger  than  P'»)  if  P\  is  implied  by  P{  but  Py  is  not  implied  by 
P\;  and  a  postulate  set  containing  P'y  is  then  said  to  be  weaker  than  the  same 
postulate  set  with  P\  replaced  by  Py.  It  is  interesting  that  of  two  equivalent 
postulate  sets  it  is  possible  for  one  to  be  weaker  than  the  other.  For  example, 
we  gave,  toward  the  end  of  Section  5.1,  a  postulate  set  for  a  field.  At  the 
time,  we  pointed  out  that  the  postulate  set  given  there  is  somewhat  redundant, 
and  we  pointed  out  some  of  the  redundancies.  Nevertheless,  the  redundant 
postulate  set  and  the  allied  set  with  the  redundancies  removed  are  equivalent; 
we  merely  listed  a  stronger  postulate  set  than  was  needed.  The  idea  of 
relative  weakness  of  equivalent  postulate  sets  was  also  encountered  in 
Section  5.3  in  connection  with  our  postulate  set  for  a  group.  This  idea  of 
relative  weakness  of  pairs  of  equivalent  postulate  sets  is  often  useful  when  we 
consider  which  of  two  equivalent  postulate  sets  may  be  thought  the  simpler. 

It  is  interesting  to  note  that  the  concept  of  independence,  and  the  method 
of  testing  for  independence,  are  applicable  in  the  formulation  of  definitions.  J 
Suppose,  for  example,  a  square  is  defined  as  “a  quadrilateral  which  is  both 
equilateral  and  equiangular.”  The  three  properties  in  the  definition— that 
of  being  a  quadrilateral,  of  being  equilateral,  and  of  being  equiangular — 
are  consistent  in  Euclidean  geometry,  for  a  figure  possessing  all  three  prop¬ 
erties  can  be  constructed.  Furthermore,  none  of  the  three  properties  is 
redundant,  for  there  exist  figures  in  Euclidean  geometry  possessing  any  two 
of  the  three  properties  but  lacking  the  remaining  one,  as  is  instanced  by  an 
equilateral  triangle  (which  possesses  the  second  and  third  properties,  but 
lacks  the  first),  an  oblong  (which  possesses  the  first  and  third  properties, 
but  lacks  the  second),  and  a  nonrectangular  rhombus  (which  possesses  the 
first  and  second  properties,  but  lacks  the  third).  In  contrast  to  the  above 
definition  of  a  square,  consider  the  definition  of  a  parallelogram  as  “a 

*E.  H.  Moore  [2],  p.  82. 

fFor  a  simple  example,  see  O.  Helmer. 

fSee  E.  R.  Stabler  [4],  p.  158. 
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quadrilateral  whose  pairs  of  opposite  sides  are  parallel  and  equal.”  This 
definition  also  involves  three  properties,  and  the  three  properties  are  certainly 
consistent  in  Euclidean  geometry.  But  the  three  properties  are  not  inde¬ 
pendent,  inasmuch  as  the  property  of  equal  opposite  sides,  for  example,  can 
be  proved  from  the  property  of  parallel  opposite  sides.  The  definition,  then, 
although  not  incorrect^  is  undesirable  because  it  contains  redundancies  and 
is  therefore  unnecessarily  cumbersome. 

There  are  many  interesting  comments  that  can  be  made  in  connection 
with  the  postulate  property  of  completeness;  we  shall  here  content  ourselves 
with  but  one.*  It  should  be  noticed  that  the  concept  of  isomorphism  of 
interpretations  of  a  postulate  set  is  an  equivalence  relation.  Thus,  if  P  is  a 
postulate  system  and  if  7i  and  /2  represent  two  interpretations  of  P,  we  may 
let  “1 1  ~  I2 ”  mean  “fi  and  /2  are  isomorphic  interpretations.”  It  is  easy  to 
check  the  equivalence  postulates,  as  given  in  Section  5.5,  to  find  that  they 
are  satisfied  by  this  meaning  of  ~ .  It  follows  that  we  can  decompose  the 
set  of  all  interpretations  of  P  into  classes,  putting  two  interpretations  into 
the  same  class  if  and  only  if  they  are  isomorphic.  Gategoricalness  of  P 
corresponds  to  the  condition  where  there  is  only  one  class  of  interpretations. 
In  the  case  of  noncategoricalness,  we  may  have  any  number  of  classes,  and 
it  may  be  that  the  number  of  such  classes  can  serve  as  a  quantitative  measure 
of  the  incompleteness  of  a  postulate  set. 

The  tests  that  we  have  discussed  for  consistency,  independence,  and 
weak  completeness  of  a  postulate  set  raise  some  deep  questions  which  have 
only  recently  been  at  least  partly  answered.  For  example,  might  not  a 
postulate  set  be  inconsistent  without  our  being  able  to  discover  the  fact? 
And,  on  the  other  hand,  might  not  a  postulate  set  be  consistent,  though  we 
should  never  know  it?  Can  a  consistent  postulate  set  containing  only  a  finite 
number  of  primitive  elements  exist  for  which  there  is  no  interpretation?  If 
such  a  consistent  postulate  set  should  have  no  known  interpretation  it 
nevertheless  can  be  thought  of  as  describing  the  structure  of  some  possible 
system  of  entities,  though  in  this  case  it  would  have  little  interest  in  applied 
mathematics.  In  connection  with  the  property  of  completeness  we  can  ask 
the  following  very  significant  question.  For  a  given  deductive  theory,  can 
one  find  a  procedure  for  deciding,  in  a  finite  number  of  explicitly  described 
steps,  whether  a  given  proposition  formulated  in  terms  of  the  technical  terms 
of  the  theory  is  or  is  not  within  the  theory?  In  other  words,  can  one  find  a 
procedure  to  decide  of  any  particular  proposition  whether  it,  its  denial,  or 
neither  belongs  to  the  theory.  Questions  like  this  have  led  in  recent  times 
to  the  development  of  an  intricate  but  important  decision  theory ,  into  which 
we  cannot  enter  here.  We  content  ourselves  by  merely  stating  that  the 
decision  problem  has  been  solved  for  some  deductive  theories  and  not  for 


*See  R.  L.  Wilder  [3],  p.  48. 
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others;  indeed,  an  argument  has  been  developed  which  demonstrates  that 
there  is  a  large  class  of  deductive  theories  for  which  a  decision  procedure 
cannot  exist.  As  still  another  important  question  in  the  theory  of  postulate 
sets,  and  one  which  is  receiving  increased  attention,  we  might  mention  the 
following.  In  interpreting  a  postulate  set,  what  are  the  allowable  ranges  of 
the  various  variables  involved  in  the  propositional  functions?  Studies  related 
to  this  question  have  been  made  by  A.  N.  Whitehead  and  Bertrand  Russell 
in  their  so-called  theory  of  logical  types.  *  Different  approaches  have  been 
employed  by  others,  notably  W.  V.  Quine,  f  Without  some  limitation 
imposed  on  these  ranges,  it  can  be  shown  that  paradoxes  may  creep  into 
one’s  work.  At  the  present  time  this  subject  is  in  a  state  of  uncertainty,  and 
it  definitely  is  not  a  matter  for  elementary  consideration. 

Some  of  the  above  matters  cause  little  or  no  trouble  in  connection  with 
most  of  the  postulate  sets  that  are  employed  by  mathematicians  and  others. 
The  reason  for  this  is  that  most  postulate  sets  originate  with  a  model  of  the 
abstract  system.  That  is,  in  most  cases,  the  individual  researcher  has  in  mind 
some  specific  model,  and  then  proceeds  to  build  a  postulate  system  that  will 
cover  his  model,  so  that,  in  practice,  the  model  or  interpretation  generally 
comes  first  and  the  postulate  set  later.  This  is  particularly  true  in  such  fields 
as  biology,  chemistry,  economics,  ethics,  law,  mechanics,  philosophy,  physics, 
zoology,  and  so  on,  where  someone  familiar  with  the  field  might  choose  to 
set  down  some  postulates  for  the  field,  or  for  some  part  of  the  field,  and  then 
see  what  theorems  can  be  logically  deduced  from  the  postulates.  Theoret¬ 
ically,  of  course,  there  is  no  need  for  the  model  to  precede  the  abstract 
development,  though  it  must  be  admitted  that  this  is  the  usual  source  of  a 
postulate  set.  If  one  were  to  put  down  some  arbitrary  symbols  for  primitive 
terms  and  then  to  try  to  formulate  postulates  about  these  terms,  it  would  be 
difficult,  without  some  model  in  mind,  to  think  of  anything  to  say,  and, 
moreover,  one  would  have  to  take  great  care  to  insure  consistency  of  the 
postulates.  Sometimes  a  new  postulate  system  is  derived  from  a  given 
postulate  system  by  altering  one  or  more  postulates  of  the  given  system.  It 
was  in  this  way  that  postulate  systems  for  Lobachevskian  and  Riemannian 
non-Euclidean  geometries  were  derived  from  the  postulates  for  Euclidean 
geometry. 

Note  that  there  are,  in  general,  three  distinct  levels  in  axiomatic  study. 
First  of  all,  there  are  the  more  or  less  informal  theories  of  specific  fields  of 
knowledge.  Secondly,  there  are  the  formal  abstract  postulationai  develop¬ 
ments  having  these  specific  fields  as  models.  Thirdly,  there  is  a  theory  which 
studies  the  properties  possessed  by  formal  abstract  postulationai  develop- 

*A.  N.  Whitehead,  and  B.  Russell. 

fW.  V.  Quine  [2]. 
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ments.  It  is  this  third,  and  highest  of  the  three  levels,  that  Hilbert  christened 
metamathematics .  * 

There  is  one  very  important  feature  of  the  postulational  method  that 
we  have  as  yet  scarcely  touched  upon.  In  part  (4)  of  the  pattern  of  formal 
axiomatics,  as  outlined  in  Section  6.1,  we  note  that  the  theorems  of  the  dis¬ 
course  must  be  logically^deduced  from  the  postulates  of  the  discourse.  That  is, 
a  formal  mathematical  organization  starts  with  a  set  of  postulates,  which  are 
then  expanded  into  a  more  or  less  extensive  mathematical  theory  through 
the  use  of  a  logic  or  set  of  rules  of  inference  accepted  as  an  adjunct  to  the 
theory.  In  a  sense,  the  use  of  a  logic  does  not  add  to  the  statements  of  the 
postulates,  but  merely  extracts  from  the  postulates  that  which  is  already  in  a 
way  implied  by  them  and  which  is  perhaps  hidden  by  them  in  a  subtle 
manner.  In  any  event,  however,  the  mathematical  theory  results  from  the 
interplay  of  two  factors,  a  set  of  postulates  and  a  logic. 

We  have  given  considerable  attention  to  the  first  of  the  two  factors  of  a 
logical  discourse,  namely,  the  postulates  of  the  discourse,  but  as  yet  we  have 
paid  scant  attention  to  the  second  factor,  namely,  the  logic  that  is  adopted 
for  deducing  the  theorems  of  the  discourse  from  the  postulates  of  the  dis¬ 
course.  The  logic  employed  by  mathematicians  is  usually  the  classical  logic 
of  Aristotle,  developed  by  the  Greek  philosopher  in  rudimentary  form  in 
the  fourth  century  b.c.  In  fact,  it  is  said  that  Aristotle  received  suggestions 
for  his  science  of  deduction  from  a  study  of  mathematical  reasoning  then  in 
vogue.  Most  people  believe  that  logical  thinking,  according  to  the  Aris¬ 
totelian  tradition,  is  virtually  instinctive  in  a  person,  whereas  serious  students 
of  the  subject  realize  that  the  ability  to  construct  valid  arguments  must  largely 
be  developed  through  diligent  effort.  In  truth,  instructors  of  subjects  requir¬ 
ing  a  knowledge  of  deductive  reasoning  recognize  that  many  students  have 
very  little  ability  in  obtaining  logical  consequences  from  given  hypotheses. 

Until  quite  recent  times  the  classical  logic  of  Aristotle  was  believed  to 
be  the  only  possible  system  of  deductive  logic.  If  this  were  the  case,  then  the 
logical  constituent  of  a  logical  discourse  would  be  a  constant  one;  that  is, 
it  would  be  the  same  for  all  logical  discourses.  One  of  the  achievements  of 
recent  times  is  the  realization  that  there  is  not  just  one  system  of  deductive 
logic  but  an  unlimited  number  of  possible  systems.  The  significance  of  this 
achievement  to  mathematics  may  in  the  future  be  very  great,  for  whereas 
it  has  been  appreciated  for  some  decades  that  the  axiomatic  constituent  in 
formal  mathematical  organization  is  a  variable  one,  now  the  task  of  the 
mathematician  may  be  considerably  complicated  by  the  fact  that  the  logical 
constituent  may  also  be  variable.  The  Principia  mathematica  of  Whitehead 

^Perhaps  the  chief  problems  of  metamathematics  center  around  the  permissible  proof 
paraphernalia  of  the  axiomatic  method  so  that  consistency  will  be  assured.  For  an 
elementary  discussion  of  some  of  the  problems  of  metamathematics  see,  for  example, 
A.  Tarski  [2],  Chap.  6.  Also  seeR.  Carnap  [3]  and  S.  C.  Kleene  [2]. 
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and  Russell  employs  a  logic  which  is  not  precisely  Aristotelian  in  nature,  yet 
the  deductions  made  there  are  acceptable.  Certainly  mathematicians  will 
do  well  to  stay  abreast  of  the  work  which  is  being  done  in  formal  logic. 
It  is  clear  that  our  own  neglect  of  the  logical  element  of  mathematics  must 
be  remedied.  We  shall  accordingly  return  to  this  matter  in  Chapter  Nine, 
after  further  preliminary  groundwork  has  been  laid.  Even  a  casual  reader 
of  these  pages  must  have  observed  the  dependence  of  the  properties  of 
postulate  sets  upon  such  laws  of  classical  logic  as  the  law  of  contradiction 
and  the  law  of  the  excluded  middle.  Should  a  logic  be  adopted  in  which 
these  laws  are  in  some  sense  questioned,  the  whole  treatment  of  consistency, 
independence,  and  completeness  of  a  postulate  set  may  need  recasting. 


PROBLEMS 


6.1.1  (a)  Construct  an  example  of  a  propositional  function  containing  three 
variables. 

(b)  Obtain,  by  appropriate  substitutions,  three  true  propositions  from 
the  propositional  function. 

(c)  Obtain,  by  appropriate  substitutions,  three  false  propositions  from 
the  propositional  function. 

(d)  Discuss  the  statement,  “God  is  love,”  in  terms  of  the  concepts 
proposition  and  propositional  function. 


6.1.2  In  elementary  mathematics,  an  equation  in  one  variable  x,  say,  can 
be  regarded  as  a  propositional  function  in  which  the  only  admissible  mean¬ 
ings  for  v  are  complex  numbers.  A  fundamental  problem  in  elementary 
mathematics  is  the  determination  of  those  complex  numbers,  if  there  are 
such,  for  which  the  equation  becomes  a  true  proposition.  What  are  the 
verifying  values  of  x  for  each  of  the  following  equations?  * 

(a)  3150  -f-  15*  =  3456  -  33/2*. 

(b)  sin  *  =  cos  x. 

(c)  2x2  —  5*  —  3  =  0. 


(d) 

(e) 

(f) 

(g) 


„2  _ 


=  3  —  *. 

+  3*  +  6  = 

2x  2 

+ 


0. 


*2  —  9 
V* 


*  +  3 


* 


+  2  —  v7*2  —  4* 


6.1.3  Discuss  the  following  definitions  (or  descriptions)  of  mathematics  in 
relation  to  the  notion  of  the  subject  as  an  assemblage  of  hypothetical  doc¬ 
trinal  functions:  f 


*See  C.  V.  Newsom  and  H.  Eves,  Chap.  12. 

t All  of  these  quotations,  except  the  last,  are  taken  with  the  permission  of  Mrs.  R.  E. 
Moritz  from  R.  E.  Moritz,  Memorabilia  mathematica  (New  York:  The  Macmillan 
Company,  1914.  Reprinted,  as  On  Mathematics  and  Mathematicians ,  by  Dover  Pub¬ 
lications,  Inc.,  1958).  The  number  following  each  quotation  is  that  assigned  to  it  in 
Moritz’s  collection. 
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(a)  The  purely  formal  sciences,  logic  and  mathematics,  deal  with  those 
relations  which  are,  or  can  be,  independent  of  the  particular  content  or  the 
substance  of  objects.  (Hermann  Hankel— 114) 

(b)  Perhaps  the  least  inadequate  description  of  the  general  scope  of 
modern  Pure  Mathematics— I  will  not  call  it  a  definition— would  be  to  say 
that  it  deals  with  form,  in  a  very  general  sense  of  the  term.  (E.  W.  Hobson 
-118) 

(c)  Mathematics  in  its  widest  signification  is  the  development  of  all 
types  of  formal,  necessary,  deductive  reasoning.  (A.  N.  Whitehead— 122) 

(d)  A  mathematical  science  is  any  body  of  propositions  which  is  capable 
of  an  abstract  formulation  and  arrangement  in  such  a  way  that  every 
proposition  of  the  set  after  a  certain  one  is  a  formal  logical  consequence  of 
some  or  all  the  preceding  propositions.  Mathematics  consists  of  all  such 
mathematical  sciences.  (J.  W.  Young— 124) 

(e)  Pure  mathematics  is  a  collection  of  hypothetical,  deductive  theories, 
each  consisting  of  a  definite  system  of  primitive,  undefined ,  concepts  or  symbols 
and  primitive,  unproved ,  but  self-consistent  assumptions  (commonly  called 
axioms)  together  with  their  logically  deducible  consequences  following  by 
rigidly  deductive  processes  without  appeal  to  intuition.  (G.  D.  Fitch  125) 

(0  Pure  mathematics  consists  entirely  of  such  asservations  as  that,  if 
such  and  such  a  proposition  is  true  of  anything,  then  such  and  such  another 
proposition  is  true  of  that  thing.  It  is  essential  not  to  discuss  whether  the 
first  proposition  is  really  true,  and  not  to  mention  what  the  anything  is  of 
which  it  is  supposed  to  be  true.  •  •  •  If  our  hypothesis  is  about  anything  and 
not  about  some  one  or  more  particular  things,  then  our  deductions  constitute 
mathematics.  Thus  mathematics  may  be  defined  as  the  subject  in  which  we 
never  know  what  we  are  talking  about,  nor  whether  what  we  are  saying  is 
true.  (Bertrand  Russell— 127) 

(g)  The  critical  mathematician  has  abandoned  the  search  for  truth. 
He  no  longer  flatters  himself  that  his  propositions  are  or  can  be  known  to 
him  or  to  any  other  human  being  to  be  true;  and  he  contents  himself  with 
aiming  at  the  correct,  or  the  consistent.  The  distinction  is  not  annulled  nor 
even  blurred  by  the  reflection  that  consistency  contains  immanently  a  kind 
of  truth.  He  is  not  absolutely  certain,  but  he  believes  profoundly  that  it  is 
possible  to  find  various  sets  of  a  few  propositions  each  such  that  the  proposi¬ 
tions  of  each  set  are  compatible,  that  the  propositions  of  each  such  set  imply 
other  propositions,  and  that  the  latter  can  be  deduced  from  the  former  with 
certainty.  That  is  to  say,  he  believes  that  there  are  systems  of  coherent  or 
consistent  propositions,  and  he  regards  it  his  business  to  discover  such  systems. 
Any  such  system  is  a  branch  of  mathematics.  (G.  J.  Keyser— 132) 

(h)  [Mathematics  is]  the  study  of  ideal  constructions  (often  applicable 
to  real  problems),  and  the  discovery  thereby  of  relations  between  the  parts 
of  these  constructions,  before  unknown.  (G.  S.  Peirce— 133) 

(i)  Mathematics  is  preferably  free  in  its  development  and  is  subject 
only  to  the  obvious  consideration,  that  its  concepts  must  be  free  from  con¬ 
tradictions  in  themselves,  as  well  as  definitely  and  orderly  related  by  means 
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of  definitions  to  the  previously  existing  and  established  concepts.  (Georg 
Cantor— 207) 

0)  Mathematicians  assume  the  right  to  choose,  within  the  limits  of 
logical  contradiction,  what  path  they  please  in  reaching  their  results. 
(Henry  Adams— 208) 

(k)  •  •  we  cannot  get  more  out  of  the  mathematical  mill  than  we  put 

into  it,  though  we  may  get  it  in  a  form  infinitely  more  useful  for  our  purpose. 
(John  Hopkinson— 239) 

(i)  In  Pure  Mathematics,  where  all  the  various  truths  are  necessarily 
connected  with  each  other  (being  all  necessarily  connected  with  those 
hypotheses  which  are  the  principles  of  the  science),  an  arrangement  is  beautiful 
in  proportion  as  the  principles  are  few;  and  what  we  admire  perhaps  chiefly 
in  the  science,  is  the  astonishing  variety  of  consequences  which  may  be 
demonstrably  deduced  from  so  small  a  number  of  premises.  (Dugald  Stewart 
-242) 

(m)  The  mathematician  may  be  compared  to  a  designer  of  garments, 
who  is  utterly  oblivious  of  the  creatures  whom  his  garments  may  fit.  To  be 
sure,  his  art  originated  in  the  necessity  for  clothing  such  creatures,  but  this 
was  long  ago;  to  this  day  a  shape  will  occasionally  appear  which  will  fit 
into  the  garment  as  if  the  garment  has  been  made  for  it.  Then  there  is  no 
end  of  surprise  and  delight.  * 

6.2.1  (a)  Establish  the  following  theorem  of  the  simple  branch  of  pure 
mathematics  of  Section  6.2:  If  a  G  c  and  b  G  c,  then  a  =  b. 

(b)  Define  the  triadic  relation  B(abc)  to  mean  either  (a  R  b  and  b  R  c) 
or  {c  R  b  and  b  R  a).  Now  prove:  If  B(abd)  holds ,  then  B(acb)  does  not  hold. 

(c)  What  is  the  meaning  of  B(abc)  in  the  three  applications  of  the 
branch  of  pure  mathematics  given  in  Section  6.2? 

6.2.2  Write  out  the  statements  of  the  theorems  and  definitions  of  the 
geometrical  application  of  the  branch  of  pure  mathematics  of  Section  6.2. 

6.2.3  Write  out  the  statements  of  the  theorems  and  definitions  of  the 
arithmetical  application  of  the  branch  of  pure  mathematics  of  Section  6.2. 

6.2.4  In  the  branch  of  pure  mathematics  of  Section  6.2,  let  A*  be  a  set  of 
four  concentric  circles  of  different  radii,  and  let  R  mean  “is  contained 
within.”  Show  that  these  meanings  yield  an  interpretation  of  the  postulate 
set.  Write  out  the  statements  of  the  theorems  and  definitions  in  the  resulting 
model. 

6.3.1  Show  that  equivalence  of  postulate  sets  is  an  equivalence  relation 
as  defined  in  Section  5.5. 

6.3.2  Show  that  the  ten  postulates  for  a  field,  as  given  in  Section  5.1,  are 
equivalent  to  the  same  set  of  postulates  with  Postulates  P5  and  P9  replaced  by 

P'5:  If  a,  b,  c  are  in  S,  then  a  (X)  (b  @  c)  =  (a  (X)  b)  @  (a  (X)  c). 

P'9:  If  a,  b,  c  are  in  S,  c  z,  and  c  (X)  a  =  c  (X)  b,  then  a  =  b. 

Quoted  by  permission  from  T.  Dantzig,  Number ,  the  Language  of  Science  (3rd  ed., 
rev.  and  augmented;  New  York:  The  Macmillan  Company,  1946),  p.  231-232. 
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6.3.3  Establish  the  absolute  consistency  of  the  postulate  set  for  a  field  (as 
given  in  Section  5.1). 

6.3.4  Which  of  the  following  proposed  postulate  sets  are  consistent? 

(a)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “For 
every  element  a  in  S,  a  (0  a  =  z-” 

(b)  The  ten  field" postulates  of  Section  5.1  plus  the  statement,  “For 
every  element  <2  in  -S’,  (a  ©  a)  ©  a  =  z” 

(c)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “For 
every  element  a  in  S,  a  (X)  <2  =  aF 

(d)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “S  con¬ 
tains  infinitely  many  elements.” 

(e)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “S  con¬ 
tains  only  a  finite  number  of  elements.” 

(f)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “If  a 
and  b  are  elements  of  S,  then  a®  b  =  a  ©  bF 

(g)  The  ten  field  postulates  of  Section  5.1  plus  the  statement,  “There 
exist  elements  a  and  b  of  S  such  that  a  9^  z,  b  ^  z,  and  a®  b  =  zF 

(h)  The  ten  field  postulates  of  Section  5.1  with  Postulate  P10  replaced 
by  P'10:  For  each  element  a  in  S  there  exists  an  element  a-1  in  S  such  that  a  (X)  a  1 
=  u. 

6.3.5  If  /?,  q,  r  represent  propositions  show  that  the  following  set  of  four 
statements  is  inconsistent:  (1)  If  q  is  true,  then  r  is  false.  (2)  If  q  is  false,  then 
p  is  true.  (3)  r  is  true.  (4)  p  is  false. 

6.3.6  Compare  the  concept  of  consistency  and  inconsistency  of  a  set  of 
simultaneous  equations  with  the  concept  of  consistency  and  inconsistency 
of  a  postulate  set. 

6.3.7  Show  that  the  table  given  in  Section  6.3  establishes  the  absolute 
consistency  of  the  postulate  set  for  plane  projective  geometry  as  given  in 
Section  4.5. 

6.3.8  Let  £  be  a  set  of  elements  and  F  a  dyadic  relation  satisfying  the 
following  postulates: 

PI:  //a  and  b  are  elements  of  S  and  if  b  F  a,  then  a  F  b. 

P2:  //a  is  an  element  of  S,  then  there  is  at  least  one  element  b  of  S  such  that 

b  F  a. 

P3:  If  a  is  an  element  of  S,  then  there  is  at  least  one  element  b  of  S  such  that 

a  F  b. 

P4:  If  a,  b,  c  are  elements  of  S  such  that  b  F  a  and  c  F  b,  then  c  F  a. 

P5:  If  a  and  b  are  elements  of  S  such  that  b  F  a,  then  there  is  at  least  one 
element  c  of  S  such  that  c  F  a  and  b  F  c. 

Show  that  the  statement,  “If  a  is  an  element  of  S,  then  there  is  at  least 
one  element  b  of  S,  distinct  from  a,  such  that  b  F  a  and  a  f  bf  is  consistent 
with  the  above  postulates. 
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(This  set  of  postulates,  augmented  by  the  above  statement,  has  been 
used  in  relativity  theory,  where  the  elements  of  S  are  interpreted  as  instants 
and  F  as  meaning  “follows.”*) 

6.3.9  Let  K  be  a  set  of  elements  and  R  a  triadic  relation,  and  consider 
the  following  eight  statements:  (1)  If  a,  b,  c  are  elements  of  K ,  then  at  least 
one  of  the  six  relations  R(abc ),  R(bca),  R(cab),  R(cba ),  R(bac ),  R(acb)  holds. 

(2)  There  are  at  least  three  elements  x,  y,  z  of  K  such  that  R(xyz)  holds. 

(3)  The  two  relations  R(abc)  and  R(acb)  cannot  both  hold.  (4)  If  R(abc) 
holds,  then  a ,  b,  c  are  distinct.  (5)  If  R(abc)  holds,  then  R(bca)  holds.  (6)  If 
R(xab)  and  R(ayb)  hold,  then  R(xay)  holds.  (7)  If  R(xab)  and  R(ayb)  hold, 
then  R(xyb)  holds.  (8)  If  R(abc)  holds  and  x  is  any  other  element  of  K ,  then 
either  R(abx)  or  R(xbc)  holds. 

(a)  Show  that  the  above  eight  statements  are  consistent. 

(b)  Show  that  the  following  four  sets  of  statements  are  equivalent  to 
one  another:  (1),  (3),  (4),  (5),  (6);  (1),  (3),  (4),  (5),  (7);  (1),  (3),  (4),  (5),  (8); 
(2),  (3),  (4),  (5),  (8).  (Each  of  these  four  sets  constitutes  a  postulate  set 
for  cyclic  order.) 

6.3.10  Consider  the  following  postulate  set,  in  which  bee  and  hive  are 
primitive  terms: 

PI :  Every  hive  is  a  collection  of  bees. 

P2:  Any  two  distinct  hives  have  one  and  only  one  bee  in  common. 

P3:  Every  bee  belongs  to  two  and  only  two  hives. 

P4:  There  are  exactly  four  hives. 

Show  that  this  set  of  postulates  is  absolutely  consistent. 

6.3.11  Deduce  the  following  theorems  from  the  postulate  set  of  Problem 
6.3.10: 

(a)  There  are  exactly  six  bees. 

(b)  There  are  exactly  three  bees  in  each  hive. 

(c)  For  each  bee  there  is  exactly  one  other  bee  not  in  the  same  hive  with  it. 

6.4.1  Show  that  the  statement  in  quotation  marks  in  Problem  6.3.8  is 
independent  of  the  postulates  of  the  problem. 

6.4.2  (a)  Show  that  Postulates  P2,  P3,  P4  of  the  postulate  set  of  Problem 
6.3.10  are  independent. 

(b)  Show  that  the  postulate  set  of  Problem  6.3.10  is  categorical. 

6.4.3  Show  that  the  postulates  for  an  equivalence  relation,  as  given  in 
Section  5.5,  are  independent. 

6.4.4  Show  that  the  postulate  set  of  the  simple  branch  of  pure  mathematics 
in  Section  6.2  is  categorical. 

6.4.5  Clearly,  if  a  postulate  p  of  a  consistent  postulate  set  P  contains  a 
primitive  term  which  is  not  among  the  primitive  terms  occurring  in  any 
of  the  other  postulates  of  P,  then  p  is  independent. 

(a)  Using  the  above  principle,  show  that  the  additional  postulate,  “All 
hives  lie  in  the  same  apiary,”  where  apiary  is  a  primitive  term,  is  independent 
of  the  four  postulates  of  Problem  6.3.10. 


*A.  A.  Robb. 
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(b)  Also  show  the  independence  of  the  additional  postulate  by  the 
method  of  models. 

6.4.6  Which  of  the  additional  statements  in  parts  (a)  through  (f)  of  Prob¬ 
lem  6.3.4  are  independent  of  the  ten  field  postulates  of  Section  5.1? 

6.4.7  Which  of  the  following  sets  of  postulates  are  categorical? 

(a)  The  ten  field  postulates  of  Section  5.1  plus  the  additional  postulate, 
“S  contains  an  infinite  number  of  elements.” 

(b)  The  ten  field  postulates  of  Section  5.1  plus  the  additional  postulate, 
“S  contains  only  a  finite  number  of  elements.” 

(c)  The  ten  field  postulates  of  Section  5.1  plus  the  additional  postulate, 
(iS  contains  exactly  two  elements.” 

6.4.8  Show  that  the  postulates  for  an  ordered  field,  as  given  in  Section 
5.1,  form  a  noncategorical  set. 

6.4.9  Consider  the  following  set  of  postulates  about  a  certain  collection  S  of 
primitive  elements  and  certain  primitive  subcollections  of  S  called  m-classes : 

PI :  If  a  and  b  are  distinct  elements  of  S,  then  there  is  one  and  only  one  m -class 
containing  a  and  b. 

D1 :  Two  m-classes  having  no  elements  in  common  are  called  conjugate 
m- classes. 

P2:  For  every  m-class  there  is  one  and  only  one  conjugate  m-class. 

P3:  There  exists  at  least  one  m -class. 

P4:  Every  m -class  contains  at  least  one  element  of  S. 

P5:  Every  m-class  contains  only  a  finite  number  of  elements  of  S. 

Establish  the  following  theorems: 

(a)  T1 :  Every  m -class  contains  at  least  two  elements. 

(b)  T2:  S  contains  at  least  four  elements. 

(c)  T3:  S  contains  at  least  six  m-classes. 

(d)  T4:  No  m-class  contains  more  than  two  elements. 

(e)  Prove  that  the  postulate  set  is  categorical. 

6.4.10  If,  in  the  ten  field  postulates  of  Section  5.1,  we  replace  P5  by  P'5: 
If  a,  b,  c  are  in  S,  then  a  (4)  (b  (X)  c)  —  (a  (+)  b)  (X)  (a  ©  c),  is  the  resulting 
set  of  postulates  consistent? 

6.4.11  Construct  all  geometries  satisfying  the  following  postulates: 

PI :  Space  S  is  a  set  of  n  points ,  n  a  positive  integer. 

P2:  A  line  is  a  non-null  subset  of  S. 

P3:  Any  two  distinct  lines  have  exactly  one  common  point. 

P4:  Every  point  lies  in  exactly  two  distinct  lines. 

6.5.1  Explain  how  Moore’s  definition  of  complete  independence  of  a 
postulate  set  is  a  generalization  of  the  definition  of  independence  as  given 
in  Section  6.4. 

6.5.2  Criticize  the  following  possible  definitions  in  Euclidean  plane 
geometry: 

(a)  A  triangle  is  said  to  be  equilateral  if  its  three  sides  are  equal  and  if 
its  three  angles  are  equal. 
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(b)  A  triangle  is  isosceles  if  two  of  its  sides  are  equal  and  if  the  angles 
opposite  these  sides  are  equal. 

(c)  A  rectangle  is  a  parallelogram  having  all  four  of  its  angles  right  angles. 

(d)  A  triangle  is  a  right  triangle  if  one  of  its  angles  is  a  right  angle  and 
the  other  two  angles  are  complementary. 

(e)  The  altitude  upon  the  base  of  an  isosceles  triangle  is  the  line  per¬ 
pendicular  to  the  base  and  bisecting  the  vertex  angle  of  the  triangle. 

(0  A  diameter  of  a  circle  is  any  straight  line  drawn  through  the  center, 
terminated  in  both  directions  by  the  circumference  of  the  circle,  and 
bisecting  the  circle. 

(g)  A  diameter  of  a  circle  is  a  maximum  chord  of  the  circle  passing 
through  the  center  of  the  circle. 

(h>  Two  triangles  are  said  to  be  in  perspective  if  the  joins  of  corresponding 
vertices  of  the  two  triangles  are  concurrent  and  the  intersections  of  corre¬ 
sponding  sides  of  the  two  triangles  are  collinear. 

6.5.3  Show  that  the  concept  of  isomorphism  of  interpretations  of  a  postulate 
set  is  an  equivalence  relation. 

6.5.4  Compare  a  branch  of  mathematics  and  its  two  constituents— a  pos¬ 
tulate  set  and  a  logic— with  a  vector  represented  as  the  sum  of  two  variable 
components. 

6.5.5  Let  P  be  a  postulate  set  composed  of  n  postulates  PI,  P2,  •  •  •,  Pn. 
Suppose  there  is  an  interpretation  that  verifies  PI  but  not  P2,  another 
interpretation  that  verifies  PI,  P2,  but  not  P3,  •  •  •,  another  interpretation 
that  verifies  PI,  P2,  •  •  •,  P(n  —  1),  but  not  Pn.  We  then  say  the  postulates 
P,  in  the  order  given,  are  type-I  sequentially  independent.  Does  type-I  sequential 
independence  imply  independence? 

6.5.6  Let  P  be  a  postulate  set  composed  of  n  postulates  PI,  P2,  •  •  *,  Pn. 
Suppose  there  is  an  interpretation  that  verifies  PI  but  none  of  the  remaining 
postulates,  another  interpretation  that  verifies  PI  and  P2  but  none  of  the 
remaining  postulates,  •  •  •,  another  interpretation  that  verifies  PI,  P2,  •  •  • 
P(n  —  1),  but  not  Pn.  We  then  say  the  postulates  P,  in  the  order  given,  are 
type-I  I  sequentially  independent.  Does  type-I  I  sequential  independence  imply 
independence? 

6.5.7  A  primitive  term  of  a  logical  theory  T  is  said  to  be  independent  of 
the  remaining  primitive  terms  of  T  if  no  definition  of  the  concerned  primitive 
term  in  terms  of  the  remaining  primitive  terms  can  be  given  which  is  provable 
in  T.  In  1899  Alessandro  Padoa  gave  the  following  method  for  establishing 
the  independence  of  a  primitive  term: 

Find  two  interpretations  of  T  such  that  the  meanings  assigned  to  the 
concerned  primitive  term  are  different  in  the  two  interpretations,  but  the 
meanings  assigned  to  the  remaining  primitive  terms  of  T  are  the  same  in 
the  two  interpretations. 

(a)  Show  that  Padoa' s  method  serves  its  purpose. 

(b)  Consider  the  theory  T  given  by  the  following: 
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Primitive  terms :  A  set  S  of  elements  a,  b,  c,  •  •  •  and  two  dyadic  relations 
R  and  D. 

Postulates :  PI.  If  a  R  b,  then  b  ft  a. 

P2.  //aRb  and  b  R  c,  then  a  R  c. 

P3.  For  each  a  there  exists  a  b  such  that  b  R  a. 

P4.  //aDb,  then  b  D  a. 

Establish  the  independence  of  the  primitive  relation  D. 

6.5.8  A  postulate  P  of  a  consistent  postulate  set  S,  it  will  be  recalled,  is 
independent  in  S  if  the  system  (S  —  P)  -f-  not-P  has  an  interpretation,  and 
S  itself  is  independent  if  each  of  its  postulates  is  independent.  Also,  S  is  com¬ 
pletely  independent  if  for  every  proper  subset  M  of  S  the  system  (S  —  M)  + 
not-Af  has  an  interpretation,  where  by  not -M  we  mean  the  set  composed 
of  the  denials  of  all  the  members  of  M.  Now  in  the  above  definition  of  an 
independent  postulate  P,  it  is  only  necessary  to  have  an  interpretation  for 
which  P  does  not  always  hold.  Sometimes  it  is  possible  to  construct  an 
interpretation  for  which  all  the  other  postulates  hold  and  P  never  holds. 
In  this  case  we  say  that  postulate  P  is  very  independent  in  S,  and  S  is  said  to 
be  very  independent  if  each  of  its  postulates  is  very  independent.  Similarly,  S 
is  said  to  be  absolutely  independent  if  for  every  proper  subset  M  of  S  there  is 
an  interpretation  for  which  S  —  M  holds  but  the  postulates  of  M  never  hold. 

(a)  Show  that  the  following  is  a  very  independent,  indeed  an  absolutely 
independent,  postulate  set  for  a  dyadic  relation  R  to  be  an  equivalence 
relation. 

PI:  R  is  reflexive. 

P2:  R  is  symmetric. 

P3:  R  is  distinctly  transitive.  (See  Problem  5.5.12.) 

(b)  Show  that  the  postulate  set  for  a  group  as  given  in  Section  5.3 
is  not  very  independent. 

(c)  Show  that  the  parallel  postulate  of  Euclidean  geometry  is  very 
independent. 


_  CHAPTER  SEVEN 

THE  REAL  NUMBER  SYSTEM 


7.1  The  Significance  of  the  Real 

Number  System  for  the 

Foundations  of  Analysis 

Mathematicians  have  divided  the  great  bulk  of  existing  mathematics 
into  three  large  categories— geometry,  algebra,  and  analysis.  It  is  not  always 
easy  to  place  a  given  branch  of  mathematics  unfalteringly  into  one  of  these 
categories,  for  the  categories  lack  clear-cut  definitions.  This  is  perhaps  to  be 
expected,  since  there  is  no  agreement  even  as  to  what  mathematics  itself  is. 
Roughly,  however,  the  classification  is  determined  somewhat  as  follows. 
The  study  of  space  inaugurated  by  the  ancient  Greeks,  and  so  elegantly 
summarized  by  Euclid  in  his  Elements ,  is  called  geometry,  together  with  the 
multitude  of  variations,  generalizations,  and  associated  studies  that  have 
since  been  created.  Among  the  variations  are  such  studies  as  the  Lobachev- 
skian  and  the  Riemannian  non-Euclidean  geometries,  and  the  non- 
Archimedean  and  non-Desarguesian  geometries;  among  the  generalizations 
are  the  higher  dimensional  geometries,  Riemann’s  broad  classes  of  geometries, 
and  instances  of  Klein’s  idea  of  a  geometry  as  the  invariant  theory  of  a 
definite  transformation  group;  and  among  the  associated  studies  are  the 
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finite  geometries,  geometries  of  elements  other  than  points,  and  various 
abstract  theories  of  sets  of  elements,  usually  called  points,  with  sets  of  relations 
in  which  these  points  are  involved.  The  abstract  symbolic  study  of  ordinary 
arithmetic  is  called  algebra,  with  its  many  variations,  generalizations,  and 
associated  studies.  Thus  such  studies  as  those  of  groups,  rings,  and  fields  are 
called  algebra,  as  are  the  studies  of  such  hypercomplex  numbers  as  quater¬ 
nions  and  matrices.  Finally,  analysis  consists  of  those  branches  of  mathe¬ 
matics  allied  to  and  arising  from  the  calculus.  Such  studies  as  the  calculus 
itself,  the  theory  of  differential  equations,  function  theory,  the  theory  of 
infinite  series,  generalized  theories  of  integration,  the  calculus  of  variations, 
and  so  on,  come  under  this  third  category.  The  idea  of  a  limit  plays  an 
important  part  in  each  of  these  latter  branches  of  mathematics;  generally 
the  presence  of  the  limit  concept  distinguishes  analysis  from  algebra. 

It  must  be  confessed  that  the  classification  of  bodies  of  mathematics 
into  the  three  categories  of  geometry,  algebra,  and  analysis  is  largely  based 
on  more  or  less  sentimental  and  traditional  grounds,  and  that  the  boundaries 
between  the  categories  are  becoming  less  and  less  well  defined.  There  are 
topics  of  mathematics  upon  which  there  is  little  agreement  regarding  the 
category  into  which  they  should  be  placed.  Thus  topology,  which  in  its  early 
development  was  recognized  as  geometric  and  was  classifiable  within  Klein’s 
program,  has  since  developed  into  a  body  of  mathematics  which  perhaps 
can  be  placed  into  any  one  of  the  three  categories;  indeed,  it  seems  quite 
possible  that  in  the  future  topology  may  even  be  characterized  as  consti¬ 
tuting  an  independent  fourth  division  of  mathematics.  Nevertheless,  in  spite 
of  such  uncertain  and  borderline  cases,  the  classification  of  most  of  mathe¬ 
matics  into  the  three  categories  of  geometry,  algebra,  and  analysis  is  reason¬ 
ably  sound,  for  most  branches  of  mathematics  have  ultimate  parenthood  in 
either  Euclidean  geometry,  symbolized  arithmetic,  or  the  limit  processes 
of  the  calculus: 

In  earlier  chapters  we  have  seen  the  importance  of  the  two  categories 
of  geometry  and  algebra  insofar  as  the  foundations  and  the  fundamental 
concepts  of  mathematics  are  concerned,  and  how,  in  particular,  these  two 
categories  influenced  the  development  of  the  modern  axiomatic  method, 
until  today  each  category  has  become  a  great  bundle  of  associated  postula- 
tional  systems.  Has  the  third  category,  that  of  analysis,  similarly  contributed 
to  the  foundations  and  the  fundamental  concepts  of  mathematics,  and  has  it, 
too,  become  deeply  penetrated  by  the  postulational  method?  We  shall  see 
that  the  story  of  analysis  is  somewhat  different  from  that  of  geometry  and 
algebra. 

The  vital  contributions  of  geometry  and  algebra  to  the  foundations  and 
fundamental  concepts  of  mathematics  arose  when  it  was  discovered  that 
there  are  geometries  and  algebras  other  than  the  traditional  ones.  Thus  it 
was  the  discovery  of  non-Euclidean  Lobachevskian  geometry  and  the  dis- 
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covery  of  noncommutative  quaternion  algebra  that  initiated  a  profound 
study  of  the  foundations  and  basic  concepts  of  these  fields  of  mathematics. 
It  was  not  a  similar  discovery  of  a  new  and  radically  different  kind  of  analysis 
that  led  to  a  corresponding  study  of  the  foundations  and  basic  concepts  of 
the  third  category  of  mathematics;  instead,  it  was  the  gradual  emergence  of 
contradictions  within  the  discipline  itself.  These  contradictions  revealed 
the  existence  of  some  grave  underlying  defects,  the  removal  of  which  required 
a  careful  and  penetrating  re-examination  of  the  foundations  upon  which  the 
development  rests.  We  shall  see  that  the  effort  to  establish  a  rigorous  foun¬ 
dation  to  analysis  proved  to  have  perhaps  an  even  more  profound  and 
revolutionary  effect  on  mathematics  as  a  whole  than  did  the  corresponding 
task  in  geometry  and  algebra,  owing  largely  to  the  fact  that  analysis  had  to 
come  to  serious  grips  with  the  old  and  difficult  concept  of  infinity. 

Of  the  many  remarkable  mathematical  discoveries  that  were  made 
in  the  seventeenth  century,  and  which  rendered  that  century  outstandingly 
productive  in  the  development  of  the  subject,  unquestionably  the  most 
remarkable  was  the  invention  of  the  calculus,  toward  the  end  of  the  century, 
by  Isaac  Newton  (1642-1727)  and  Gottfried  Wilhelm  Leibniz  (1646-1716). 
The  new  tool  proved  to  be  almost  unbelievably  powerful  in  its  astonishingly 
successful  disposal  of  hosts  of  problems  which  had  been  baffling  and  quite 
unassailable  in  earlier  days.  Its  general  methods  were  able  to  cope  with  such 
matters  as  lengths  of  curved  arcs,  planar  areas  bounded  by  quite  arbitrary 
curves,  surface  areas  and  volumes  of  all  sorts  of  solids,  intricate  maximi¬ 
zation  and  minimization  problems,  all  kinds  of  problems  involving  related 
rates  of  change,  geometrical  questions  about  tangents  and  normals, 
asymptotes,  envelopes,  and  curvature,  and  physical  questions  about  work, 
energy,  power,  pressure,  centers  of  gravity,  inertia,  and  gravitational  attrac¬ 
tion.  It  was  natural  that  this  wide  and  amazing  applicability  of  the  new 
subject  should  attract  mathematical  researchers  of  the  day,  and  that  papers 
should  be  turned  out  in  great  profusion  with  seemingly  little  concern  regard¬ 
ing  the  very  unsatisfactory  foundations  of  the  subject.  It  was  much  more 
exciting  to  apply  the  marvelous  new  tool  than  to  examine  its  logical  sound¬ 
ness,  for,  alter  all,  the  processes  employed  justified  themselves  to  the 
researchers  in  view  of  the  fact  that  they  worked. 

Although  for  almost  a  hundred  years  after  the  invention  of  the  calculus 
by  Newton  and  Leibniz  little  serious  work  was  done  to  strengthen  logically 
the  underpinning  of  the  rapidly  growing  superstructure  of  the  calculus,  it 
must  not  be  supposed  that  there  was  no  criticism  of  the  existing  weak  base. 
Long  controversies  were  carried  on  by  some  mathematicians,  and  even  the 
two  founders  themselves  were  dissatisfied  with  their  accounts  of  the  funda¬ 
mental  concepts  of  the  subject.  One  of  the  ablest  criticisms  of  the  faulty 
foundations  came  from  a  nonmathematician,  the  eminent  metaphysician 
Bishop  George  Berkeley  (1685-1753),  who  insisted  that  the  development 
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of  the  calculus  involved  the  logical  fallacy  of  a  shift  in  the  hypothesis.  Let 
us  clarify  this  particular  criticism  by  considering  one  of  Newton’s  approaches 
to  what  is  now  called  differentiation. 

In  his  Quadrature  of  Curves  of  1704,  Newton  determines  the  derivative 
(or  fluxion,  as  he  called  it)  of  *3  as  follows.  We  here  paraphrase  Newton’s 
treatment: 

In  the  same  time  that  *,  by  growing,  becomes  x  +  o,  the  power  x 3  becomes 
(*  +  o)3,  or 

*3  -f-  3  *2o  +  3  xo2  +  o3, 
and  the  growths,  or  increments, 

o  and  3*20  -f*  3 xo2  +  o3 

are  to  each  other  as 

1  to  3*2  +  3*o  -|-  o2. 

Now  let  the  increments  vanish,  and  their  last  proportion  will  be  1  to  3*2, 
whence  the  rate  of  change  of  *3  with  respect  to  *  is  3*2. 

The  shift  of  hypothesis  to  which  Bishop  Berkeley  objected  is  evident;  in 
part  of  the  argument  o  is  assumed  to  be  nonvanishing,  while  in  another 
part  it  is  taken  to  be  zero.  Replies  were  made  to  Bishop  Berkeley’s  criticism, 
but,  without  a  logically  rigorous  treatment  of  limits,  the  objection  could  not 
be  well  met.  Alternative  approaches  proved  to  be  no  less  confusing.  Indeed, 
all  the  early  explanations  of  the  processes  of  the  calculus  are  obscure,  en¬ 
cumbered  with  difficulties  and  objections,  and  not  easy  to  read.  Some  of  the 
explanations  border  on  the  mystical  and  the  absurd,  as  the  statement  by 
Johann  Bernoulli  that  “a  quantity  which  is  increased  or  decreased  by  an 
infinitely  small  quantity  is  neither  increased  nor  decreased.” 

When  the  theory  of  a  mathematical  operation  is  only  poorly  understood, 
there  is  the  danger  that  the  operation  will  be  applied  in  a  blindly  formal  and 
perhaps  illogical  manner.  The  performer,  not  aware  of  possible  limitations 
upon  the  operation,  is  likely  to  use  the  operation  in  instances  where  it  does 
not  necessarily  apply.  Instructors  of  mathematics  see  mistakes  of  this  sort 
made  by  their  students  almost  every  day.  Thus  one  student  of  elementary 
algebra,  firmly  convinced  that  a0  —  1  for  all  real  numbers  a ,  will  set  0°  =  1, 
while  another  such  student  will  assume  that  the  equation  ax  —  b  always 
has  exactly  one  real  solution  for  each  pair  of  given  real  values  a  and  b. 
Again,  a  student  of  trigonometry  may  think  that  the  formula 

Vl  -  sin2  *  =  cos  * 

holds  for  all  real  *.  A  student  of  the  calculus,  not  aware  of  improper  integrals, 
may  get  an  incorrect  result  by  apparently  correctly  applying  the  rules  of 
formal  integration,  or  he  may  arrive  at  a  paradoxical  result  by  applying  to  a 
certain  convergent  infinite  series  some  rule  which  holds  only  for  absolutely 
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convergent  infinite  series.  This  is  essentially  what  happened  in  analysis 
during  the  century  following  the  invention  of  the  calculus.  Attracted  by  the 
powerful  applicability  of  the  subject,  and  lacking  a  real  understanding  of 
the  foundations  upon  which  the  subject  must  rest,  mathematicians  manipu¬ 
lated  analytical  processes  in  an  almost  blind  manner,  often  being  guided 
only  by  a  native  intuition  of  what  was  felt  must  be  valid.  A  gradual  accumu¬ 
lation  of  absurdities  was  bound  to  result,  until,  as  a  natural  reaction  to  the 
pell-mell  employment  of  intuition  and  formal  manipulation,  some  con¬ 
scientious  mathematicians  felt  impelled  to  attempt  the  difficult  task  of 
establishing  a  rigorous  foundation  under  the  subject. 

The  work  of  the  great  Swiss  mathematician  Leonhard  Euler  (1707-1783) 
represents  the  outstanding  example  of  eighteenth-century  formal  manipula¬ 
tion  in  analysis.  It  was  by  purely  formal  devices  that  Euler  discovered  the 
remarkable  formula 

eix  —  cos  x  i  sin  v, 

which,  for  x  =  t,  yields 

eiT  +  1  =  0, 

a  relation  connecting  five  of  the  most  important  numbers  of  mathematics. 
By  formal  manipulation  Euler  arrived  at  an  enormous  number  of  curious 
relations  like 

7 r 

i  loge  i  =  - 

and  succeeded  in  showing  that  any  nonzero  real  number  n  has  an  infinite 
number  of  logarithms  (for  a  given  base),  all  imaginary  if  n  <  0  and  all 
imaginary  but  one  if  n  >  0.  The  beta  and  gamma  functions  of  advanced 
calculus,  and  many  other  topics  in  analysis,  similarly  originated  with  Euler. 
He  was  a  most  prolific  writer  on  mathematics.  His  name  is  attached  to 
practically  every  branch  of  the  subject,  and  it  is  of  interest  to  note  that  his 
amazing  productivity  was  not  in  the  least  impaired  when  he  had  the  mis¬ 
fortune  to  become  totally  blind.  It  was  his  remarkable  mathematical  intuition 
that  generally  held  him  on  the  right  path,  but  there  are,  nevertheless, 
numerous  instances  where  Euler’s  formal  manipulation  led  him  into  absurd¬ 
ities.  For  example,  if  the  binomial  theorem  is  applied  formally  to  (1  —  2)-1, 
we  find 

—  1  =1+2  +  4  +  8  +  16+  •  •  • , 
a  result  which  caused  Euler  no  wonderment!  Also,  by  adding  the  two  series 

„  x  11  x 

x  +  x-  +  •  •  •  =  - -  and  1  H - b  —  +  •  •  •  =  - 7, 

1—  V  X  —  l 


Euler  found  that 


1  1 
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Seventeenth-  and  eighteenth-century  mathematicians  had  little  under¬ 
standing  of  infinite  series,  and  this  field  of  analysis  furnished  many  paradoxes. 
Thus  consider  the  series 

S=l-1  +  1-1  +  1-1  +  1-1  +  1-1+.... 

If  we  group  the  terms  of  this  series  in  one  way,  we  have 

S’  =  (1  -  1)  +  (1  -  1)  +  (1  -  1)  +  (1  -  1)  +  •  •  • 

=  0-f0-f0  +  0+  •  •  • 

=  0, 

while,  if  we  group  the  terms  in  another  way,  we  have 

S  =  1  -  (1  -  1)  -  (1  -  1)  ~  (1  -  1)  -  (1  -  1)  -  •  •  • 

=  1  -  o-  o-  o-  o-  ••• 

=  1. 

Leibniz  argued  that  since  the  sums  0  and  1  are  equally  probable,  the  correct 
sum  of  the  series  is  the  average  value  1  /2.  This  value,  too,  can  be  obtained 
in  a  purely  formal  manner,  for  we  have 

S’=l  —  (1—  1-j-l—  1  +  1  — 

=  1  -  S, 

whence  2 S  =  1,  or  S'  =  1/2. 

The  first  suggestion  of  a  real  remedy  for  the  unsatisfactory  state  of  the 
foundations  of  analysis  came  from  Jean-le-Rond  d’Alembert  (1717-1783), 
who  very  correctly  observed  in  1754  that  a  theory  of  limits  was  needed,  but 
a  sound  development  of  this  theory  was  not  forthcoming  until  1821.  The 
earliest  mathematician  of  the  first  rank  actually  to  attempt  a  rigorization 
of  the  calculus  was  the  Italian  mathematician  Joseph  Louis  Lagrange 
(1736-1813).  The  attempt,  based  upon  representing  a  function  by  a  Taylor’s 
series  expansion,  was  far  from  successful,  for  it  ignored  necessary  matters  of 
convergence  and  divergence.  It  was  published  in  1797  in  Lagrange’s  monu¬ 
mental  work,  Theorie  des  fonctions  analytiques.  Lagrange  was  one  of  the  leading 
mathematicians  of  the  eighteenth  century,  and  his  work  had  a  deep  influence 
on  later  mathematical  research;  with  Lagrange’s  work  the  long  and  difficult 
task  of  banishing  intuition  and  blind  formal  manipulation  from  analysis 
had  begun. 

In  the  nineteenth  century,  the  superstructure  of  analysis  continued  to 
rise,  but  on  ever-deepening  foundations.  A  debt  is  undoubtedly  owed  to 
Carl  Friedrich  Gauss,  whom  we  have  already  met  in  connection  with  the 
development  of  non-Euclidean  geometry,  for  Gauss,  more  than  any  other 
mathematician  of  his  time,  broke  from  intuitive  ideas  and  set  new  high 
standards  of  mathematical  rigor.  Also,  in  a  treatment  of  hypergeometric 
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series  made  by  Gauss  in  1812,  we  encounter  what  is  generally  regarded  as  the 
first  really  adequate  consideration  of  the  convergence  of  an  infinite  series. 

A  great  forward  stride  was  made  in  1821,  when  the  French  mathe¬ 
matician  Augustin-Louis  Cauchy  (1789—1857)  successfully  executed  d  Alem¬ 
bert’s  suggestion,  by  developing  an  acceptable  theory  of  limits  and  then 
defining  continuity,  differentiability,  and  the  definite  integral  in  terms  of  the 
limit  concept.  It  is  essentially  these  definitions  that  we  find  in  the  more 
carefully  written  of  today’s  elementary  textbooks  on  the  calculus.  The 
limit  concept  is  certainly  indispensable  for  the  development  of  analysis,  for 
convergence  and  divergence  of  infinite  series  also  depend  upon  this  concept. 
Cauchy’s  rigor  inspired  other  mathematicians  to  join  the  effort  to  rid 
analysis  of  formal  manipulation  and  intuition. 

The  demand  for  an  even  deeper  understanding  of  the  foundations  of 
analysis  was  strikingly  brought  out  in  1874,  when  there  was  exhibited  an 
example,  earlier  originated  by  the  German  mathematician  Karl  Weierstrass 
(1815-1897),  of  a  continuous  function  having  no  derivative,  or,  what  is  the 
same  thing,  a  continuous  curve  possessing  no  tangent  at  any  of  its  points. 
This  example  dealt  a  severe  blow  to  the  employment  of  geometric  intuition 
in  analytical  studies. 

The  theory  of  limits,  upon  which  the  ideas  of  continuity  and  differentia¬ 
bility  depend,  had  been  built  previously  upon  a  simple  intuitive  geometrical 
notion  of  the  real  number  system.  Indeed,  the  real  number  system  had  been 
taken  more  or  less  for  granted,  as  it  still  is  in  most  of  our  elementary  calculus 
texts.  It  became  apparent  that  the  theory  of  limits,  continuity,  and  differen¬ 
tiability  depend  upon  more  recondite  properties  of  the  real  number  system 
than  had  been  supposed.  Other  ties  connecting  analysis  with  properties  of  the 
real  number  system  became  evident  when  Riemann  found  that  Cauchy  had 
unnecessarily  restricted  his  definition  of  a  definite  integral;  Riemann  showed 
that  definite  integrals  as  limits  of  sums  exist  even  when  the  integrands  are 
discontinuous.  In  elementary  calculus  these  integrals  are  referred  to  as 
improper  integrals  and  require  some  care  in  their  handling.  Riemann  also 
produced  a  function  which  is  continuous  for  all  irrational  values  of  the 
variable  but  discontinuous  for  all  rational  values.  Examples  such  as  these 
made  it  increasingly  apparent  that  Cauchy  had  not  struck  the  true  bottom 
of  the  difficulties  in  the  way  of  a  sound  foundation  for  analysis;  beneath 
everything  still  lay  properties  of  the  real  number  system  that  required  urgent 
understanding.  Accordingly,  Weierstrass  advocated  a  program  wherein  the 
real  number  system  itself  should  first  be  rigorized;  then  all  the  basic  concepts 
of  analysis  should  be  derived  from  this  number  system.  This  remarkable 
program,  known  as  the  arithmetigation  of  analysis ,  proved  to  be  difficult  and 
intricate,  but  was  ultimately  realized  by  Weierstrass  and  his  followers,  so 
that  today  it  can  be  fairly  said  that  classical  analysis  has  been  firmly  estab¬ 
lished  upon  the  real  number  system  as  a  foundation. 
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Weierstrass  died  in  1897,  just  one  hundred  years  after  the  first  publica¬ 
tion,  in  1797  by  Lagrange,  of  an  attempt  to  rigorize  the  calculus.  Weierstrass 
was  a  very  influential  teacher,  and  his  meticulously  prepared  lectures  and  his 
program  for  arithmetizing  analysis  established  a  new  ideal  for  analysts. 
Much  of  the  remainder  of  this  book  will  be  devoted  to  fundamental  ideas 
in  mathematics  brought  out  by  late  nineteenth-century  and  early  twentieth- 
century  studies  of  the  real  number  system.  The  next  few  sections  will  be 
devoted  to  an  examination  of  the  real  number  system  itself. 


7.2  The  Postulational  Approach 

to  the  Real  Number  System 

It  was  pointed  out  in  the  last  section  that  Weierstrass  and  his  followers, 
through  their  program  of  arithmetizing  analysis,  established  the  real  number 
system  as  the  foundation  for  the  whole  of  classical  analysis.  Euclidean 
geometry,  through  its  analytical  interpretation  as  considered  in  Section  4.4, 
can  also  be  made  to  rest  upon  the  real  number  system,  and  mathematicians 
have  shown  that  most  other  branches  of  geometry  are  consistent  if  Euclidean 
geometry  is  consistent.  Again,  since  the  real  number  system,  or  some  part 
of  it,  can  serve  for  interpreting  so  many  branches  of  algebra,  it  appears  that 
the  consistency  of  a  good  deal  of  algebra  can  also  be  made  to  depend  upon 
that  of  the  real  number  system.  We  have  seen  that  a  branch  of  mathematics 
which  possesses  an  interpretation  involving  only  a  finite  number  of  primitive 
elements  may  be  shown  to  be  absolutely  consistent,  whereas  relative  con¬ 
sistency  is  the  best  that  can  be  hoped  for,  by  models,  when  the  branch  of 
mathematics  possesses  no  finite  interpretation.  It  is  in  these  latter  situations, 
like  that  of  Euclidean  geometry,  that  the  real  number  system  has  so  frequently 
come  to  the  mathematicians’  assistance,  so  that  today  it  can  be  stated  that 
essentially  all  existing  mathematics  is  consistent  if  the  real  number  system 
is  consistent.  Herein  lies  the  tremendous  importance  of  the  real  number 
system  for  the  foundations  of  mathematics.  It  is  no  wonder  that  the  real 
number  system  has  received  intensive  study  in  recent  times,  and  that  this 
study  has,  as  we  shall  see  in  the  next  two  chapters,  unearthed  some  new  and 
even  deeper  foundational  problems.  For  the  present  we  shall  concern  our¬ 
selves  with  two  current  methods  of  approaching  the  study  of  the  real  number 
system,  devoting  this  section  to  the  postulational  approach  and  the  remaining 
sections  of  the  chapter  to  the  so-called  definitional,  or  genetical,  approach. 

If  the  bulk  of  mathematics  is  to  be  derived  from  the  real  number  system, 
it  is  desirable  that  the  real  number  system  itself,  which  mathematicians  had 
long  taken  intuitively  for  granted,  be  developed  in  a  logical  fashion.  One 
way  of  attempting  this  is  by  the  postulational  method.  We  might  choose  the 
real  numbers,  together  with  some  operations  on  them,  as  primitive  terms, 
and  then  hope  to  set  down  as  postulates  a  number  of  the  intuitive  properties 
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of  the  real  numbers  and  the  primitive  operations  that  should  be  sufficient 
to  characterize  the  system.  Now  it  was  observed  in  Section  5.1  that  the  real 
number  system  is  an  example  of  an  ordered  field.  But  the  real  number  system 
is  not  characterized  by  this  description,  for  the  rational  number  system, 
which  is  not  isomorphic  with  the  real  number  system,  also  is  an  ordered 
field.  It  might  seem,  however,  that  our  intuitive  notion  of  the  real  number 
system  could  perhaps  be  characterized  by  the  postulates  of  an  ordered 
field  augmented  by  some  additional  restrictions.  Such  is  indeed  the  case, 
although  the  characterization  also  holds  for  systems  isomorphic  with  the 
real  number  system,  and  it  is  interesting  that  the  addition  of  only  one  more 
property  to  those  constituting  the  postulates  for  an  ordered  field  will  suffice 
for  the  purpose.  We  shall  refer  to  the  additional  property  as  the  postulate 
of  continuity,  and  we  shall  define  a  complete  ordered  field  to  be  an  ordered 
field  which  further  satisfies  the  postulate  of  continuity. 

It  will  be  convenient  to  reproduce  here,  with  a  few  simplifications,  the 
postulate  set  for  an  ordered  field  that  was  given  earlier  in  Section  5.1.  We 
retain,  for  pedagogical  reasons,  the  two  redundant  postulates  PI  and  P9; 
for  a  derivation  of  them  from  the  other  postulates  of  a  field,  see  Appendix, 
Section  A. 2. 

An  ordered  field  is  a  set  S  of  elements,  with  two  binary  operations  on  the 
set,  here  denoted  by  ©  and  ©,  satisfying  the  following  postulates: 

PI:  If  a  and  b  are  in  S,  then  a  ©  b  =  b  ©  a. 

P2:  If  a  and  b  are  in  S,  then  a  ©  b  =  b  ©  a. 

P3:  If  a,  b,  c  are  in  S,  then  (a  ©  b)  ©  c  =  a  ©  (b  ©  c). 

P4:  If  a,  b,  c  are  in  S,  then  (a  ©  b)  ©  c  =  a  ©  (b  ©  c). 

P5:  If  a,  b,  c  are  in  S,  then  a  (X)  (b  ©  c)  =  (a  ©  b)  ©  (a  ©  c). 

P6:  S  contains  an  element  z  (zero)  such  that  for  any  element  a  of  S,  a  ©  z  =  a. 

P7 :  S  contains  an  element  u  (unity),  different  from  z,  such  that  for  any  element 
a  of  S,  a  ©  u  =  a. 

P8:  For  each  element  a  in  S  there  exists  an  element  a  in  S  such  that  a  ©  a  =  z. 

P9:  If  a,  b,  c  are  in  S,  c  9^  z,  and  a  ©  c  =  b  ©  c,  then  a  =  b. 

P10:  For  each  element  a  z  in  S,  there  exists  an  element  a~l  in  S  such  that 

a  ©  a-1  =  u. 

Pll :  There  exists  a  subset  P,  not  containing  z,  of  the  set  S  such  that  if  a  9^  z, 
then  one  and  only  one  of  a  and  a  is  in  P. 

PI 2:  If  a  and  b  are  in  P,  then  a  ©  b  and  a  ©  b  are  in  P. 

Definition  1 .  The  elements  of  P  are  known  as  the  positive  elements  of  S, 
all  other  nonzero  elements  of  S  are  known  as  negative  elements  of  S. 

Definition  2:  If  a  and  b  are  elements  of  S  and  if  a  ©  b  is  positive, 
then  we  write  a  ©  b  and  b  ©  a,  and  we  say  that  a  is  greater  than  b  and  that  b 
is  less  than  a. 
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Definition  3:  An  element  a  of  S  is  called  an  upper  bound  of  a  nonempty 
collection  M  of  elements  of  S  if,  for  each  element  m  of  M,  either  m  Q  a 
or  m  =  a. 

Definition  4:  An  element  a  of  S  is  called  a  least  upper  bound  of  a  non¬ 
empty  collection  M  of  elements  of  S  if  a  is  an  upper  bound  of  M  and  if 
a  @  b  whenever  b  is  any  other  upper  bound  of  M. 

We  now  define  a  complete  ordered  field  (sometimes  called  a  linear  con¬ 
tinuum)  to  be  an  ordered  field  which  further  satisfies 

PI 3  (the  postulate  of  continuity):  If  a  nonempty  collection  M  of  elements 
of  S  has  an  upper  bound ,  then  it  has  a  least  upper  bound.  * 

Our  intuitive  notion  of  the  real  number  system  satisfies  all  the  postulates 
of  a  complete  ordered  field.  Now  mathematicians  have  shown  that  any  two 
possible  interpretations  of  a  complete  ordered  field  must  be  isomorphic 
with  each  other;  in  other  words,  mathematicians  have  shown  that  if  we 
assume  the  postulate  set  for  a  complete  ordered  field  to  be  consistent,  then 
this  postulate  set  is  also  categorical.  Of  course,  the  matter  of  consistency  of 
the  postulate  set  still  remains  a  fundamental  question.  We  cannot  offer  our 
intuitive  concept  of  the  real  number  system  as  an  interpretation  acceptable 
for  establishing  consistency  of  the  postulate  set  for  a  complete  ordered  field 
any  more  than  we  can  offer  our  intuitive  concept  of  points  and  lines  as  an 
acceptable  interpretation  of  the  postulate  set  for  Euclidean  geometry,  for 
an  intuitive  concept  may  involve  inconsistencies.  The  question  of  consistency 
of  the  postulate  set  for  a  complete  ordered  field  is  obviously  an  important 
one,  for  upon  the  settlement  of  this  question  rests  the  consistency  of  the  great 
bulk  of  mathematics.  We  shall  return  to  this  matter  in  later  sections. 

Since,  apart  from  isomorphisms,  there  can  exist  at  most  one  interpreta¬ 
tion  of  a  complete  ordered  field,  there  is  no  harm  in  simply  defining  such 
a  possible  interpretation  to  be  the  real  number  system.  Accordingly,  we  shall 
henceforth  think  of  the  elements  of  a  complete  ordered  field  as  the  familiar 
set  of  all  real  numbers,  and  we  shall  employ  the  symbols  +,  X,  >,  < 
(without  circles  around  them)  in  their  familiar  senses. 

Part  of  the  significance  of  the  postulate  of  continuity  can  be  seen  from 
the  fact  that  the  rational  number  system,  which  satisfies  all  the  postulates 
of  an  ordered  field,  does  not  satisfy  this  additional  postulate.  Consider,  for 
example,  the  collection  M  of  all  positive  rational  numbers  whose  squares 
are  less  than  2.  The  rational  number  3  is  certainly  an  upper  bound  of  this 
collection  M ,  but  there  is  no  rational  number  which  is  a  least  upper  bound 

of  M.  The  only  number  which  could  be  a  least  upper  bound  of  M  is  v2, 
and  this,  we  know,  is  not  a  rational  number. 

The  full  significance  of  the  postulate  of  continuity,  however,  only 
appears  in  the  use  made  of  it  to  derive  many  important  theorems  of  the 

^Essentially  this  set  of  postulates,  augmented  with  many  redundancies,  was  given 
by  E.  V.  Huntington  in  1911.  See  E.  V.  Huntington  [10]  (Bibliography). 
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calculus.*  Let  us  illustrate  this  use  by  deriving  with  its  aid  the  so-called 
Archimedean  law  of  real  numbers  and  Dedekind’s  theorem  of  continuity. 
We  shall  assume  some  of  the  simpler  properties  of  the  real  number  system 
which  the  system  possesses  by  virtue  of  the  fact  that  it  is  an  ordered  field. 
In  particular,  we  shall  assume  that  the  familiar  integers  are  embedded 
among  the  real  numbers,  and  that  “between”  any  two  distinct  real  numbers 
there  lies  at  least  one  other  real  number.  It  would  take  us  too  far  afield 
to  derive  all  these  needed  properties. 

The  Archimedean  Law  of  Real  Numbers:  If  a  and  b  are  any  two 
positive  real  numbers ,  there  exists  a  positive  integer  n  such  that  na  >  b. 

Suppose  the  law  is  not  true,  and  that  na  ^  b  for  all  positive  integers  n. 
Denote  by  M  the  collection  of  all  numbers  of  the  form  na.  Then,  by  hy¬ 
pothesis,  b  is  an  upper  bound  of  M,  and  hence,  by  the  postulate  of  con¬ 
tinuity,  M  has  a  least  upper  bound  k.  That  is,  pa  ^  k  for  all  positive  integers 
/?,  and  there  is  no  number  less  than  k  for  which  this  is  true.  It  follows  that 
(n  +  \)a  =  na  +  a  ^  k  for  all  positive  integers  n,  or  that  na  ^  k  —  a 
for  all  positive  integers  n.  But  this  is  a  contradiction  of  the  fact  that  k  is  a 
least  upper  bound  of  M.  The  desired  result  now  follows  by  reductio  ad 
absurdum. 

Dedekind’s  Theorem  of  Continuity:  If  all  the  real  numbers  are  separated 
into  two  collections ,  which  we  denote  by  L  and  R,  in  such  a  way  that  (1)  every  real 
number  is  either  in  L  or  in  R,  (2)  L  and  R  each  contain  at  least  one  real  number , 
(3)  every  real  number  in  L  is  less  than  every  real  number  in  R,  then  there  is  a  real 
number  c  such  that  all  real  numbers  less  than  c  are  in  L  and  all  real  numbers  greater 
than  c  are  in  R. 

Since  the  collection  L  of  real  numbers  has  an  upper  bound,  namely, 
any  real  number  of  the  collection  R,  it  has  a  least  upper  bound  c.  Let  m 
be  any  real  number  less  than  c;  then  m  must  lie  in  L,  for  otherwise  m  would 
lie  in  R  and  thereby  be  an  upper  bound  of  L  which  is  less  than  the  least  upper 
bound  c.  On  the  other  hand,  let  n  be  any  real  number  greater  than  c;  then 
n  must  lie  in  R,  since  no  number  in  L  exceeds  c.  This  completes  the  proof 
of  the  theorem. 

The  reader  has  undoubtedly  recognized  in  the  Archimedean  law  of 
real  numbers  and  in  Dedekind’s  theorem  of  continuity  arithmetized  forms 

*An  interesting  venture  is  the  construction  of  an  elementary  calculus  textbook 
based  upon  the  above  13  postulates  for  the  real  number  system.  To  save  time,  one 
can  assume  the  needed  consequences  of  the  first  12  postulates.  This  should  cause 
the  student  no  difficulty,  for,  since  the  first  12  postulates  are  those  of  ordinary 
algebra,  he  should  already  be  familiar  with  the  needed  consequences,  though  not 
with  their  derivations.  The  basic  concepts  of  the  calculus  depend  upon  the  remain¬ 
ing  postulate  (the  postulate  of  continuity)  and  these  concepts  are  far  different  from 
those  of  algebra;  these  concepts  should  be  derived.  That  such  a  venture  is  perfectly 
feasible  is  attested  by  the  fact  that  it  has  already  been  successfully  accomplished. 
See  E.  G.  Begle. 
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of  the  two  important  geometrical  statements  respectively  referred  to  earlier 
in  the  book  as  the  postulate  of  Archimedes  and  as  Dedekind’s  postulate  of 
continuity.  A  very  interesting  fact  is  that  not  only  can  Dedekind’s  theorem 
of  continuity  be  deduced  from  the  postulate  of  continuity  (by  employing 
properties  of  an  ordered  field),  but  the  postulate  of  continuity  can  similarly 
be  deduced  from  the  Dedekind  theorem  of  continuity.  In  other  words,  the 
Dedekind  theorem  of  continuity  and  the  postulate  of  continuity  are  equiva¬ 
lent  statements,  and  either  one  can  be  used  as  Postulate  PI  3  in  the  list  of 
postulates  for  the  real  number  system.  Let  us  now  complete  the  present 
section  by  deducing  the  postulate  of  continuity  from  Dedekind’s  theorem 
of  continuity. 

Let  M  be  a  nonempty  collection  of  real  numbers  having  an  upper 
bound  k.  Choose  L  as  the  collection  of  all  real  numbers  which  are  less  than 
at  least  one  number  of  M ,  and  let  R  consist  of  all  the  rest  of  the  real  numbers. 
We  now  show  that  L  and  R  satisfy  the  three  conditions  of  the  Dedekind 
continuity  theorem. 

1.  The  collections  L  and  R  contain  all  the  real  numbers;  this  follows  from 
the  definition  of  collection  R. 

2.  L  is  not  empty,  for,  inasmuch  as  M  contains  at  least  one  number  m, 
the  number  m  —  1  belongs  to  L.  R  is  not  empty,  for  it  contains  k. 

3.  Every  number  of  L  is  less  than  every  number  of  R.  For  suppose  there 
is  a  number  b i  of  R  which  is  less  than  a  number  a\  of  L.  Since  there 
exists  a  number  m  of  M  greater  than  ai,  we  have  b\  <  a\  <  m,  and 
b\  must  belong  to  L.  This  contradicts  the  hypothesis  that  b  i  is  a  number 
of  R. 

Thus  the  three  conditions  of  the  Dedekind  continuity  theorem  hold, 
and  the  theorem  guarantees  the  existence  of  a  real  number  c  such  that  all 
real  numbers  less  than  c  are  in  L  and  all  real  numbers  greater  than  c  are 
in  R.  Now  suppose  there  is  a  number  mi  of  M  such  that  m\  >  c.  Then 
between  c  and  m  i  there  would  be  a  real  number  d  which  would  be  less  than 
m\  and  which  would  therefore  belong  to  L,  a  situation  which  is  impossible, 
since  every  real  number  greater  than  c  belongs  to  R.  It  follows  that  every 
number  of  M  is  less  than,  or  at  most  equal  to,  c,  and  that  c  is  therefore  an 
upper  bound  of  M .  Next,  let  e  be  any  real  number  less  than  c.  Then  e  belongs 
to  L,  and,  by  the  definition  of  collection  L,  there  exists  a  number  m  of  M 
such  that  m  >  e.  Hence  no  number  less  than  c  is  an  upper  bound  of  M. 
We  have  now  shown  that  any  nonempty  collection  of  real  numbers  having  an 
upper  bound  also  has  a  least  upper  bound;  this  is  the  postulate  of  continuity. 

7.3  The  Natural  Numbers  and  the 

Principle  of  Mathematical  Induction 

From  the  postulate  set  for  the  real  number  system  one  can,  by  simply 
making  appropriate  definitions  in  terms  of  the  real  numbers  (as  was  indi- 
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cated  in  Section  4.4),  arrive  at  an  interpretation  of  Euclidean  geometry 
and  thus  make  the  consistency  of  Euclidean  geometry  depend  upon  that  of 
the  real  number  system.  This  suggests  that  perhaps  a  similar  undertaking 
can  be  carried  out  relative  to  the  real  number  system  itself.  That  is,  perhaps 
we  can  start  from  some  postulate  set  more  basic  than  that  for  the  real 
number  system,  and,  by  merely  making  appropriate  definitions,  arrive  at  an 
interpretation  of  the  real  number  system.  Thus  the  consistency  of  the  real 
number  system  would  depend  upon  the  consistency  of  the  more  fundamental 
postulate  set.  This  turns  out  to  be  possible.  In  the  remaining  sections  of  this 
chapter,  therefore,  we  shall  sketch  a  manner  in  which  the  real  number  system 
may  be  arrived  at  by  definition  from  a  postulate  set  for  the  much  simpler 
and  more  basic  system  of  natural  numbers,  the  numbers  of  counting.  In  this 
way  we  shall  make  the  consistency  of  the  great  bulk  of  mathematics  depend 
upon  that  of  the  very  fundamental  system  of  natural  numbers.  The  success 
of  carrying  through  the  program  is  credited  to  late  nineteenth-century 
researches  by  Peano,  Dedekind,  and  Cantor,  and  its  accomplishment  has 
given  the  mathematician  a  considerable  feeling  of  security  concerning  the 
consistency  of  most  of  mathematics.  This  attitude  follows  from  the  fact  that 
the  natural  number  system  seems  to  have  an  intuitive  simplicity  lacking  in 
most  other  mathematical  systems,  and  the  natural  numbers  have  been  very 
extensively  handled  over  a  long  period  of  time  without  producing  any  known 
inner  contradictions. 

Let  us  in  this  section  consider  a  postulate  set  for  the  system  of  natural 
numbers,  and  then  in  the  following  sections  show  how,  by  definitions  in 
terms  of  the  natural  numbers,  we  can  introduce  first  the  set  of  all  integers, 
next  the  set  of  all  rational  numbers,  and  then  the  set  of  all  real  numbers. 
We  here  choose  for  our  postulate  set  for  the  system  of  natural  numbers  one 
which  bears  marked  similarity  to  some  of  our  earlier  postulate  sets. 

For  our  primitive,  or  undefined,  terms  we  take  a  set  JV  of  elements 
called  natural  numbers ,  together  with  two  binary  operations  on  the  set,  called 
addition  and  multiplication  and  denoted  by  +  and  X,  satisfying  the  following 
ten  postulates. 

Nl:  If  a  and  b  are  in  N,  then  a  +  b  =  b  T  a. 

N2:  If  a  and  b  are  in  N,  then  a  X  b  =  b  X  a. 

N3:  If  a,  b,  c  are  in  N,  then  (a  -f-  b)  -f-  c  =  a  +  (b  +  c). 

N4:  If  a,  b,  c  are  in  N,  then  (a  X  b)  X  c  =  a  X  (b  X  c). 

N5:  If  a,  b,  c  are  in  N,  then  a  X  (b  +  c)  =  (a  X  b)  +  (a  X  c). 

N6:  There  exists  a  natural  number  1  such  that  a  X  1  =  a  for  all  a  in  N. 

N7:  If  a,  b,  c  are  in  N  and  if  c  +  a  =  c  T~  b,  then  a  =  b. 

N8:  If  a,  b,  c  are  in  N  and  if  c  X  a  =  c  X  b,  then  a  =  b. 

N9:  For  given  a  and  b  in  N,  one  and  only  one  of  the  following  holds',  a  =  b, 
a  +  x  =  b,  a  =  b  +  y,  where  x  and  y  are  in  N. 
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N10:  If  M  is  a  set  of  natural  numbers  such  that  (1)  M  contains  the  natural 
number  1,  (2)  M  contains  the  natural  number  k  +  1  whenever  it  contains  the  natural 
number  k,  then  M  contains  all  the  natural  numbers. 

The  first  five  postulates  are  exact  counterparts  of  the  first  five  postulates 
for  the  real  number  system;  they  postulate  the  commutativity  and  associa¬ 
tivity  of  the  two  binary  operations  of  addition  and  multiplication  and  the 
distributivity  of  multiplication  over  addition.  N6  postulates  the  existence  of 
a  multiplicative  identity.  N7  and  N8  grant  the  cancellation  laws  for  addition 
and  multiplication.  These  two  postulates  and  Postulates  Nl,  N2,  N4  are 
actually  redundant,  inasmuch  as  they  all  can  be  derived  from  the  remaining 
Postulates  N3,  N5,  N6,  N9,  and  N10,  but  all  are  retained  here  to  simplify 
the  student’s  experience  in  the  subsequent  deduction  of  theorems.  The  reader 
can  find  derivations  of  Nl,  N2,  N4,  N7,  and  N8  from  the  remaining  five 
postulates  in  the  Appendix,  Section  A. 2.  It  is  Postulate  N9  that  introduces 
the  idea  of  order  among  the  natural  numbers,  for,  if  we  have  the  situation 
a  +  x  =  b,  we  write  a  <  b  or  b  >  a,  and  say  a  is  less  than  b  or  b  is  greater  than  a. 

Postulate  N10  is  an  extremely  interesting  postulate  and  merits  more 
extended  comment.  It  is  known  as  the  postulate  of  finite  induction ,  and  it  leads 
to  some  theorems  which  are  useful  for  establishing  certain  propositions 
that  involve  all  natural  numbers.  One  of  these  theorems,  often  referred  to  as 
the  principle  of  mathematical  induction ,  is  the  following. 

Theorem  1 .  Let  P(n)  be  a  proposition  that  is  defined  for  every  natural  number  n. 
7/P(l)  is  true ,  and  if  P(k  +  1)  is  true  whenever  P(k)  is  true ,  than  P(n)  is  true 
for  all  natural  numbers  n. 

The  proof  of  the  theorem  follows  immediately  from  the  postulate  of  finite 
induction;  for  consider  the  set  M  of  natural  numbers  for  which  the  proposi¬ 
tion  P(n)  is  true.  By  hypothesis  M  contains  the  natural  number  1  and  also  the 
natural  number  k  - f-  1  whenever  it  contains  the  natural  number  k.  Hence, 
by  the  postulate  of  finite  induction,  M  contains  all  the  natural  numbers. 

Although  this  theorem  is  referred  to  as  the  principle  of  mathematical 
induction,  it  really  has  nothing  to  do  with  induction  in  the  technical  sense 
of  the  word.  To  reason  inductively,  it  will  be  recalled  from  our  discussion  in 
Section  1.2,  is  to  draw  a  general  conclusion  from  specific  data;  for  example, 
one  may  conclude  that  the  sun  will  rise  in  the  east  tomorrow  morning  because 
it  has  risen  there  so  many  mornings  in  the  past.  In  such  reasoning,  the 
conclusion  is  not  one  that  must  follow  but  is  only  one  which  probably  follows. 
Reasoning  of  this  sort  is  clearly  inadmissible  in  a  logical  development.  It  is 
often  the  case  that  induction  in  the  usual  sense  is  employed  in  order  to  guess 
a  general  proposition  P(n)  in  the  first  place — that  is,  P(n)  may  be  conjectured 
from  observing  several  instances,  like  P(l),  P( 2),  and  P( 3),  of  P(ri)  —  but  the 
rigid  establishment  of  Pin)  by  the  so-called  principle  of  mathematical 
induction  as  just  explained  is,  of  course,  a  purely  deductive  procedure.  The 
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reader  is  undoubtedly  familiar  with  the  principle  of  mathematical  induction 
from  his  work  in  elementary  algebra  and  is  already  aware  that  it  yields  a 
powerful  procedure  for  establishing  certain  propositions.  * 

It  is  worth  pointing  out  merely  as  a  matter  of  interest  that  the  postulate 
of  finite  induction  is  a  consequence  of  the  postulates  for  the  real  number 
system  given  in  the  preceding  section.  There  we  would  define  the  natural 
numbers  as  the  elements  of  the  smallest  subset  N  of  the  real  numbers  which 
are  such  that  a  -f-  1  is  in  YV  if  and  only  if  a  =  0  or  a  is  in  N.  The  statement  of 
the  postulate  of  finite  induction  is  then  an  immediate  consequence  of  this 
definition.  Thus,  whereas  the  statement  N10  is  taken  as  a  postulate  in  our 
development  of  the  natural  number  system,  it  occupies  the  position  of  a 
theorem  in  our  development  of  the  real  number  system. 

The  consequences  of  the  postulate  set  for  the  system  of  natural  numbers 
constitute  a  very  interesting  body  of  mathematics  known  as  the  theory  of 
numbers ;  this  branch  of  mathematics  has  provided  many  recreational  oppor¬ 
tunities  for  the  amateur  mathematician  as  well  as  a  collection  of  conjectural 
propositions  that  mathematical  genius  has  been  unable  to  prove  or  disprove. 
The  following  twenty  theorems  can  be  regarded  as  the  start  of  a  development 
of  this  branch  of  mathematics.  We  begin  by  repeating  the  theorem  known  as 
the  principle  of  mathematical  induction  and,  following  convention,  we  shall 
abbreviate  a  X  b  by  ab. 


Some  consequences  of  the  Postulates  Nl  through  N10 

Theorem  1  (the  principle  of  mathematical  induction).  Let  P(n)  be  a 
proposition  that  is  defined  for  every  natural  number  n.  If  P(l)  is  true ,  and  if  P(k  -f-  1) 
is  true  whenever  P(k)  is  true ,  then  P(n)  is  true  for  all  natural  numbers  n. 

Theorem  2.  7/'a  +  c  =  b  +  c,  then  a  =  b. 

For 

c  -j-  a  =  a  -|-  c  (by  Nl) 

=  b  +  c  (by  hypothesis) 

=  c  +  b,  (by  Nl) 

whence 

a  —  b.  (by  N7) 

Theorem  3.  If  ac  =  be,  then  a  =  b. 

For 

ca  =  ac  (by  N2) 

*One  of  the  earliest  acceptable  statements  of  the  principle  of  mathematical  induc¬ 
tion  appears  in  Blaise  Pascal’s  Traite  du  triangle  arithmetique ,  written  in  1653  but  not 
printed  until  1665.  The  principle  reminds  one  of  the  children’s  game  in  which  toy 
soldiers  are  lined  up  so  that  if  one  falls  it  will  knock  over  the  next  one.  If  no  soldier 
is  pushed  over,  or  if  some  soldier  is  A.W.O.L.,  then  the  complete  line  will  not  fall. 


Sec.  7.3  NATURAL  NUMBERS  AND  MATHEMATICAL  INDUCTION  211 


whence 


=  be 
=  cb, 

a  =  b. 


(by  hypothesis) 
(by  N2) 

(by  N8) 


Theorem  4.  If  e  is*  a  natural  number  such  that  ae  =  a  for  some  natural 
number  a,  then  e  =  1. 

For 

ae  =  a  (by  hypothesis) 

=  a\,  (by  N6) 

whence 

e  =  1.  (by  N8) 

Definition  1.  If  a  -\-  x  =  b,  we  say  that  a  is  less  than  b  or  b  is  greater 
than  a ,  and  we  write  a  <  b  or  b  >  a. 

Theorem  5.  If  a  and  b  <2r<?  any  two  natural  numbers ,  then  one  and  only  one 
of  the  following  situations  holds :  a  =  b,  a  <  b,  a  >  b. 

This  is  an  immediate  consequence  of  N9  and  Definition  1. 

Theorem  6.  If  a  <  b  and  b  <  c,  then  a  <  c. 

Since,  by  hypothesis,  a  <  b  and  b  <  c,  there  exist  natural  numbers 
x  andjy  such  that 

a  x  =  b,  b  +  y  =  c. 

Then 

a  +  (x  +  y)  =  (a  +  x)  +  y 
=  b  +  y 
=  c. 

It  follows  that 

a  <  c. 


(by  Def.  1) 

(by  N3) 
(substitution) 
(substitution) 


(by  Def.  1) 

Theorem  7.  If  a  <  b,  then  c  +  a  <  c  +  b. 

Since,  by  hypothesis,  a  <  b,  there  exists  a  natural  number  x  such  that 

(by  Def.  1) 


Then 


It  follows  that 


a  T  x  —  b. 

(c  T  a)  +  x  =  c  +  (a  +  x) 

=  c  +  b. 

c  -f-  a  <  c  T  b. 


(by  N3) 
(substitution) 

(by  Def.  1) 


Theorem  8.  If  a  <  b  and  c  <  d,  then  a  -f-  c  <  b  +  d. 

Since,  by  hypothesis,  a  <  b  and  c  <  d,  there  exist  natural  numbers 
*  and  y  such  that 

a  +  x  =  b,  c  T  y  =  d.  (by  Def.  1) 

Then 

{a  +  c)  +  (x  +  y)  —  a  +  [c  +  (x  +  y)]  (by  N3) 
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It  follows  that 


=  a  +  [(*  +  y)  +  c] 
=  a  +  [x  +  (y  +  c)  ] 
=  a  +  [x  +  (c  +  y)] 

—  a  +  (x  +  d) 

=  (a  T  x)  -{-  d 

—  b  d. 


(by  Nl) 
(by  N3) 
(by  Nl) 
(substitution) 
(by  N3) 
(substitution) 


a  c  b  -(-  d. 


(by  Def.  1) 


Theorem  9.  If  a  <  b,  then  ca  <  cb. 

Since,  by  hypothesis,  a  <  b,  there  exists  a  natural  number  x  such  that 


Then 

It  follows  that 


a  x  =  b. 

ca  +  cx  =  c(a  -j-  x) 
=  cb. 


(by  Def.  1) 

(by  N5) 
(substitution) 


ca  <  cb.  (by  Def.l) 

Definition  2.  If  a  and  k  are  any  natural  numbers,  we  define 

a 1  =  a ,  ak+l  =  aka. 


Theorem  10.  If  a  is  a  natural  number ,  then  an  is  defined  for  all  natural 
numbers  n. 

Let  M  be  the  set  of  natural  numbers  n  for  which  an  is  defined.  Now 
M  contains  1,  and  it  contains  k  +  1  whenever  it  contains  k.  Therefore,  by 
N10,  M  contains  all  the  natural  numbers. 

Remark.  Definition  2  is  said  to  be  a  definition  by  induction. 

Theorem  11.  ln  =  1  for  all  natural  numbers  n. 

I.  By  Definition  2,  1 1  =  1 . 

II.  Suppose  A:  is  a  natural  number  such  that  \k  =  1.  Then 


1*+1  =  1*1  (by  Def.  2) 

=  1*  (by  N6) 

=  1.  (by  supposition) 

Hence  the  theorem  follows,  by  the  principle  of  mathematical  induction 
(Theorem  1). 

Theorem  12.  If  a,  b,  n  are  any  natural  numbers ,  then  (ab)n  =  anbn. 

I.  By  Definition  2,  ( ab )l  =  ab  =  a1bl. 

II.  Suppose  A;  is  a  natural  number  such  that  ( ab)k  =  akbk.  Then 


(ab)k+1  =  ( ab)k(ab ) 

=  ( akbk )  (ab) 
—  [(akbk)a]b 


(by  Def.  2) 
(by  supposition) 
(by  N4) 
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=  [ak(bka)]b 
=  [ak(abk)]b 
=  [(aka)bk]b 
=  ( ak+1bk)b 
=  ak+\bkb) 
=  ak+1bk+1. 


(by  N4) 
(by  N2) 
(by  N4) 
(by  Def.  2) 
(by  N4) 
(by  Def.  2) 


Hence  we  have  the  theorem,  by  the  principle  of  mathematical  induction 
(Theorem  1). 

Theorem  13.  If  a,  m,  n  are  any  natural  numbers ,  then  aman  =  am+n. 

I.  For  any  fixed  m, 

am a 1  =  ama  (by  Def.  2) 

=  am+l.  (by  Def.  2) 


II.  Suppose  A:  is  a  natural  number  such  that  amak 

amak+l  =  am(aka) 

=  ( amak)a 


=  am+ka 


am+k .  Then 

(by  Def.  2) 
(by  N4) 
(by  supposition) 
(by  Def.  2) 
(by  N3) 


=  ^(ra+O  +  l 
—  am+(k+l) 

Hence  the  theorem  follows,  by  the  principle  of  mathematical  induction 
(Theorem  1). 


Theorem  14.  If  a,  m,  n  are  any  natural  numbers ,  then  (am)n  =  amn. 

I.  For  any  fixed  m, 

( am ) 1  =  am  (by  Def.  2) 

=  aml.  (by  N6) 


II.  Suppose  A;  is  a  natural  number  such  that  ( am)k  —  amk.  Then 

(am)k+ 1  =  ( am)k(am )  (by  Def.  2) 


_  amkam  (by  supposition) 


_  Qmk-\-m 

—  (jm(k+ 1) 


(by  Th.  13) 
(by  N6  and  N5) 

Hence  we  have  the  theorem,  by  the  principle  of  mathematical  induction 
(Theorem  1). 


Theorem  15.  1  ^  n  for  all  natural  numbers  n. 

I.  Clearly  1^1. 

II.  Suppose  A;  is  a  natural  number  such  that  1  ^  k.  If  1  =  k,  then 


and 

If  1  <  k,  then 


1  T  1  =  1  +  k, 
1  <  1  T  k. 

1  T  1  <C  1  T  A;, 


(substitution) 
(by  Def.  1) 
(by  Th.  7) 
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whence  there  exists  a  natural  number  x  such  that 


(1  +  1)  +  *  =  1  +  k.  (by  Def.  1) 

Therefore 

1  +  (1  +  x)  =  1  +  k,  (by  N3) 

and 

1  <  1  +  k.  (By  Def.  1) 

Thus,  in  any  case,  1  <  1  +  k,  and  the  theorem  follows  by  the  principle  of 
mathematical  induction  (Theorem  1). 

Theorem  16.  If  a  is  a  natural  number ,  then  there  is  no  natural  number  n 
such  that  a  <  n  <  a  +  1 . 

Suppse  there  is  a  natural  number  n  such  that  a  <  n  <  a  +  1.  Then 
there  exist  natural  numbers  *  andjy  such  that 


Then 


a  f-  x  =  n,  n  f-  y  =  a  f-  1. 


(by  Def.  1) 


or 

But 

Therefore 


a  +  (x  -f  y)  —  (a  +  x)  +  y 
=  n  +  y 
=  a  -f  1, 


(by  N3) 
(substitution) 
(substitution) 


x  f-  y  —  1 . 


(by  N7) 


*  ^  1,  y  =  1  ■ 


(by  Th.  15) 


x  +  y  ^  1  +  1  (by  Th.  8) 

5^  1.  (By  Def.  1  and  Th.  5) 

But  this  is  a  contradiction,  for  (by  Theorem  5)  we  cannot  have  both 
x  f- y  =  1  and  x  -j-y  7^  1.  Hence  the  theorem  follows,  by  reductio  ad  absurdum. 


Theorem  17.  If  h  <  k  +  1,  then  h  ^  k. 

Suppose  h  >  k.  Then  we  have  k  <  //  <  k  +  1,  which  contradicts 
Theorem  16.  The  theorem  thus  follows,  by  reductio  ad  absurdum. 


Theorem  18  (the  principle  of  the  smallest  natural  number).  In  any 
nonempty  set  M  of  natural  numbers  there  is  always  a  smallest  natural  number  (that 
is,  there  is  always  a  natural  number  in  M  which  is  less  than  any  other 
natural  number  in  M). 

Suppose  M  has  no  smallest  natural  number.  Then  1  is  not  in  M ,  for 
if  it  were  it  would  have  to  be  the  smallest  natural  number  in  M  (by  Theorem 
1 5).  Let  7  be  the  set  of  all  natural  numbers  which  are  less  than  every  natural 
number  in  M.  Clearly  M  and  7  have  no  members  in  common. 

I.  We  have  seen  that  1  is  in  T. 

II.  Suppose  k  is  a  natural  number  in  T.  Let  h  be  any  natural  number 
less  than  k  T  1.  Then,  by  Theorem  17 ,  h  ^  k.  It  follows,  by  the  definition 
of  set  T,  that  h  cannot  be  in  M.  Hence,  if  k  +  1  is  in  M,  then  k  +  1  is  the 
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smallest  natural  number  in  M  (by  Theorem  16).  Since,  by  supposition,  M 
has  no  smallest  natural  number,  it  follows  that  A:  +  1  is  not  in  M  and  that 
all  natural  numbers  in  M  must  be  greater  than  k  +  1 .  Thus  k  +  1  is  in  T. 

It  now  follows,  by  the  postulate  of  finite  induction  (Postulate  N10), 
that  T  contains  all  the  natural  numbers.  But  this  is  impossible,  since  M  is 
nonempty  and  M  and  T  have  no  members  in  common.  Hence  the  theorem 
follows  by  reductio  ad  absurdum. 

Theorem  19  (a  second  principle  of  mathematical  induction).  Let  P(n) 
be  a  proposition  that  is  defined  for  every  natural  number  n.  If  P(l)  is  true ,  and  if , 
for  each  natural  number  m,  P(m)  is  true  whenever  P(k)  is  true  for  all  natural  numbers 
k  <  m,  then  P(n)  is  true  for  all  natural  numbers  n. 

Let  M  be  the  set  of  natural  numbers  for  which  P(n)  is  false,  and  suppose 
M  is  not  empty.  Then,  by  Theorem  18,  M  contains  a  smallest  natural  number 
m.  Note  that  m  9^  1,  for  by  hypothesis  P(l)  is  true.  Hence,  for  all  natural 
number  k  <  m,  P{k)  is  true.  By  hypothesis  we  must  then  have  P(m)  true. 
But  this  contradicts  the  fact  that  m  is  in  M.  It  follows  that  M  must  be  empty, 
or  that  Pin)  is  true  for  all  natural  numbers  n. 

Theorem  20  (a  third  principle  of  mathematical  induction).  Let  P(n) 
be  a  proposition  that  is  defined  for  every  natural  number  n.  If  P(n)  is  true  for  all 
n  ^  h,  where  h  is  some  fixed  natural  number ,  and  if,  for  each  natural  number  m, 
P(m  +  h)  is  true  whenever  P(k)  is  true  for  all  natural  numbers  k  such  that 
m  ^  k  <  m  T  h,  then  P(n)  is  true  for  all  natural  numbers  n. 

Let  M  be  the  set  of  natural  numbers  for  which  P(n)  is  false,  and  suppose 
M  is  not  empty.  Then,  by  Theorem  1 8,  M  contains  a  smallest  natural  number 
s.  Note  that  s  >  h,  for  by  hypothesis  P(n)  is  true  for  all  n  S  h.  Since  s  >  h, 
there  exists  a  natural  number  m  such  that  s  =  m  +  h.  Hence  P(k)  is  true 
for  all  natural  numbers  k  <  m  -j-  h.  In  particular,  then,  P(k)  is  true  for  all 
natural  numbers  k  such  that  m  ^  k  <  m  +  h.  By  hypothesis  we  must  then 
have  P{m  -f-  h)  true.  But  this  contradicts  the  fact  that  m  +  h  =  s  is  in  M. 
It  follows  that  M  must  be  empty,  or  that  P{n)  is  true  for  all  natural  numbers  n. 

Note  that,  by  virtue  of  Theorem  16,  Theorem  1  is  the  special  case  of 
this  theorem  where  h  —  1. 

As  already  revealed,  the  natural  number  system  occupies  a  very  basic 
position  in  the  foundations  of  mathematics,  and  its  development  from  the 
Postulates  N1  through  N10  is  not  difficult.  There  is  much  to  recommend  that 
this  postulate  set,  rather  than  a  postulate  set  for  Euclidean  geometry,  say,  be 
the  first  significant  postulate  set  that  a  student  of  mathematics  undertakes 
to  study.  This  is  because  of  the  inherent  simplicity  of  the  postulate  set  and 
the  inexperienced  student’s  already  long  and  wide  acquaintance  with  ele¬ 
mentary  properties  of  the  natural  numbers.  It  is  recommended  that,  at  the 
start,  the  redundant  postulates  Nl,  N2,  N4,  N7,  and  N8  be  retained.  Perhaps 
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the  reader  would  care  to  try  his  hand  at  deductive  work  by  adding  some 
simple  theorems  to  the  list  given  above. 

There  are  postulate  sets  for  the  natural  number  system  other  than  the 
one  we  have  considered,  but  in  general  these  other  sets  are  somewhat  more 
sophisticated  and  less  easy  for  a  beginner  to  develop.  Very  popular  with 
mathematicians  is  the  following  alternative  set  given  by  Peano. 

For  primitive  terms  Peano  chose  natural  number ,  successor ,  and  7.  About 
these  primitive  terms  he  postulated: 

N'l :  1  is  a  natural  number. 

N'2:  For  each  natural  number  x  there  exists  exactly  one  natural  number ,  called 
the  successor  of  x,  which  will  be  denoted  by  x' . 

N'3:  1  is  not  the  successor  of  any  natural  number. 

N'4:  If  x'  =  y',  then  x  =  y. 

N'5  (the  postulate  of  finite  induction):  If  M  is  a  set  of  natural  numbers 
such  that  (1)  M  contains  1,  (2)  M  contains  x'  whenever  it  contains  x,  then  M  contains 
all  the  natural  numbers. 

If  we  interpret  x'  to  mean  x  +  1,  then  it  is  an  easy  matter  to  show  that 
Peano’s  postulates  are  implied  by  our  earlier  postulate  set.  With  appropriate 
definitions  of  addition  and  multiplication,  it  can  also  be  shown  that  the 
earlier  postulate  set  is  implied  by  Peano’s  set.  Thus  the  two  postulate  sets 
are  equivalent.  It  can  be  shown  that  each  set,  if  consistent,  is  categorical. 
It  is  clear  that  Peano’s  set  is  closely  allied  to  our  conception  of  the  natural 
numbers  as  an  ordered  sequence  of  elements  starting  with  some  first  element. 
Many  careful  and  detailed  developments  of  the  Peano  postulate  set  exist 
in  the  literature,  together  with  a  definitional  approach  to  the  real  number 
system  from  it.  * 

7.4  The  Integers  and  the 

Rational  Numbers 

As  promised,  we  shall  now  indicate  a  program  wherein  the  real  number 
system  is  obtained  from  the  natural  number  system  in  a  purely  definitional 
way,  without  the  introduction  of  any  further  postulates.  Moreover,  the  same 
procedure  will  actually  be  extended  to  the  more  general  complex  number 
system.  The  program  will  be  carried  out  in  stages.  In  this  section  we  shall 
show  how,  from  the  natural  numbers,  one  can  obtain  the  set  of  all  integers 
(positive,  zero,  and  negative)  and  then,  after  this  is  done,  the  set  of  all  rational 
numbers.  In  the  following  section  we  shall  then  obtain  the  set  of  all  real 
numbers  and,  finally,  the  set  of  all  complex  numbers.  Because  of  limited 
space,  our  treatment  will  necessarily  be  very  sketchy,  but  then  our  aim  is 
merely  to  indicate  the  procedure  rather  than  to  execute  it  in  all  its  details. 
For  a  complete  development,  beginning  with  Peano’s  postulate  set  for  the 

*See,  for  example,  E.  G.  H.  Landau  [1]  or  C.  C.  MacDuffee  [1], 
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natural  numbers,  the  interested  reader  may  consult  Landau’s  scholarly  and 
very  readable  Foundations  of  Analysis.  * 

We  shall  continue  the  numbering  of  definitions  and  theorems  that  was 
started  in  the  previous  section.  Each  stage  of  the  program  will  be  preceded 
by  a  brief  paragraph  explaining  the  motivation  of  the  ideas  which  follow. 
Without  this  motivation,  some  of  the  definitions  might  seem  highly  artificial. 
It  must  be  understood,  however,  that  the  discussion  of  motivation  is  quite 
informal  and  is  not  an  actual  part  of  the  demonstration;  in  fact,  in  the 
discussion  some  concepts  will  be  anticipated  even  before  they  are  introduced 
logically.  The  proofs  of  the  theorems  will,  in  large  part,  be  left  for  the  reader 
to  supply.  We  shall  assume  that  the  familiar  properties  of  the  natural  numbers 
have  been  developed  and  are  at  our  disposal. 

The  integers 

[Motivation.  Any  integer,  be  it  positive,  zero,  or  negative,  can  be 
represented  as  the  difference  m  —  n  of  two  natural  numbers  m  and  n.  If 
m  >  n,  then  m  —  n  is  a  positive  integer;  if  m  —  n,  then  m  —  n  is  the  zero 
integer;  if  m  <  n,  then  m  —  n  is  a  negative  integer.  This  suggests  the  idea 
of  logically  introducing  the  integers  as  ordered  pairs  (m,n)  of  natural  num¬ 
bers,  where  by  (m,n)  we  actually  have  in  mind  the  difference  m  —  n,  although 
this  fact  does  not  enter  into  any  demonstrations.  With  this  interpretation  of 
an  ordered  pair  of  natural  numbers  we  are  led  to  say  that  (a,b)  =  { c,d )  if 
and  only  if  a  —  b  =  c  —  d,  or  a-fd^b  +  c.  Similarly,  since  (a  —  b)  + 
{c  —  d)  —  {a  +  c)  —  {b  -f-  d)}  we  are  led  to  define  (a,b)  +  {c,d)  to  be  the 
ordered  pair  of  natural  numbers  (a  +  c,  b  -f-  d),  and  since  ( a  —  b){c  —  d)  = 
(, ac  -f  bd)  —  {ad  +  be),  we  are  led  to  define  {a,b){c,d)  to  be  the  ordered 
pair  of  natural  numbers  {ac  +  bd ,  ad  +  be).] 

Definition  3.  We  shall  denote  the  set  of  all  ordered  pairs  (m}n)  of 
natural  numbers  m  and  n  by  I. 

Definition  4.  If  {a, b)  and  {c,d)  are  two  members  of  /,  we  say  that 
(a,b)  =  {c,d)  if  and  only  if  a  -f*  d  —  b  -\-  c. 

Remark.  It  is  to  be  noted  that  the  definition  of  equality  of  members  of 
1  is  not  the  ordinary  concept  of  equality  as  an  identity  of  symbols. 

Theorem  21.  Equality  of  members  of  I  is  an  equivalence  relation ;  that  is , 
(1)  (a,b)  =  (a,b),  (2)  if  (a,b)  =  (c,d),  then  (c,d)  =  (a,b),  (3)  if  (a,b)  = 
(c,d)  and  (c,d)  =  (e,f),  then  (a,b)  =  (e,f). 

The  reflexive  and  symmetric  properties  are  easily  established.  To  estab¬ 
lish  the  transitive  property  we  note  that,  if  {a,b,)  —  {c,d)  and  {c,d)  =  (<?,/), 
then  a  +  d  =  b  +  c  and  c  +  /  =  d  +  e.  Consequently  a  +  d  -f  c  +  /  = 
b  +  c-\-d-\-e,  or  a-ff  =  b  +  e,  whence  {a,b)  =  (e,f). 


*This  is  E.  G.  H.  Landau  [1]. 
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Theorem  22.  For  any  natural  number  x,  (x  +  a,  x  +  b)  —  (a,b). 

Definition  5.  We  define  the  sum  of  any  two  members  (a,b)  and  (c,d) 
of  /,  taken  in  this  order,  to  be  the  ordered  pair  of  natural  numbers  (a  +  c, 
b  +  d),  and  we  write 

(a,b)  T  ( c,d )  =  {a  -f-  c,  b  -f-  d). 

This  binary  operation  on  members  of  I  will  be  called  addition. 

Theorem  23.  I  is  closed  under  addition ,  which  is  commutative ,  associative , 
and  well  defined. 

The  closure,  commutativity,  and  associativity  properties  follow  imme¬ 
diately  from  the  definition  of  addition  and  from  the  corresponding  properties 
of  natural  numbers.  Since  equality  of  elements  in  /  is  not  an  identity  equality, 
addition  in  I,  if  it  is  to  be  of  use  to  us,  should  be  a  well-defined  operation  in  I. 
That  is,  if  (a,b)  =  (. a',b' )  and  (c,d)  =  ( c',d '),  we  should  have  that  ( a',b ')  -f- 
(c',d')  —  ( a,b )  +  (c,d).  We  show  that  this  is  so  as  follows: 

(a',b')  +  (c',d')  =  (a'  +  c',  b'  +  d')  (by  Def.  5) 

=  {a  T  c  T  a'  -(-  A,  a  -f-  c  4-  b'  T  d') 

(by  Th.  22) 

=  (a  4~  c  T  d  4~  c' ,  d  4~  b  -f-  c'  4~  /) 

(by  Def.  4) 

=  (a  -|-  r,  b  T-  di)  (by  Th.  22) 

=  (a, b)  +  {c,d).  (by  Def.  5) 

Definition  6.  We  define  the  product  of  any  two  members  (a,b)  and 
(c,d)  of  /,  taken  in  this  order,  to  be  the  ordered  pair  of  natural  numbers 
(ac  -j-  bd ,  ad  +  be),  and  we  write 

(. a,b)(c,d )  =  (ac  +  bd,  ad  +  be). 

This  binary  operation  on  members  of  /  will  be  called  multiplication. 

Theorem  24.  I  is  closed  under  multiplication,  which  is  commutative ,  asso¬ 
ciative,  well  defined,  and  distributive  over  addition. 

We  shall  show  that  multiplication  is  well  defined  and  is  distributive 
over  addition;  the  other  properties  are  easily  established.  Suppose,  then, 
that  (a,b)  =  ( a' ,b ')  and  (c,d)  =  (c' ,d').  Then 


—  (a' c'  -fi  b'd' ,  i 

a'd' 

4-  b'c') 

(by  Def.  6) 

==  (ac'  ~)“  bd'  + 

a'c' 

+  b'd', 

ac'  4~  bd' 

+ 

a'd'  +  b'c') 

(by  Th.  22) 

=  (ac'  T  bd'  -)- 

a'c' 

4-  b'd', 

c' (a  4-  b') 

+ 

d'(b  + 

a’)) 

=  (ac'  -J-  bd'  -|- 

a'c' 

+  b'd', 

c' (a'  4~  b) 

+ 

d'(b'  + 

a)) 

(by  Def.  4) 

=  (ac'  +  bd'  4~ 

a'c' 

+  b'd', 

c'a'  4-  c'b 

+ 

d'b'  4-  . 

d'a ) 

=  (ac'  4"  bd' ,  ch 

b  4- 

d'a) 

(by  Th.  22) 

—  (ac  4-  bd  4-  (■ 

ic'  4~  bd' ,  ac 

4~  bd  4~  (- 

:'b 

+  d'a) 

(by  Th.  22) 
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—  {ac  -(-  bd  -f-  ac '  d-  bd' ,  a{c  d')  -(-  b{d  -f-  c ')) 

=  {ac  d-  bd  d-  gc'  d~  bd',  a{c'  d~  d)  -f-  b{d'  d-  c))  (by  Def.  4) 

—  ( ac  “I-  bd  “I-  ac'  "I-  bd' ,  ac'  d~  ad  d~  bd'  ~f~  be) 

=  {ac  +  bd,  ad  +  be)  (by  Th.  22) 

=  ( a,b){c,d )  (by  Def.  6) 

and  multiplication  is  well  defined.  To  establish  the  distributivity  property 
we  note  that 

{a,b)[{c,d)  +  (*,/)] 


—  f-  e,  d  +  f) 

(by 

Def. 

5) 

=  (a{c 

+  e)  +  b(d  +/),  a 

'{d  +/)  +  b{c  +  e)) 

(by 

Def. 

6) 

=  {{ac 

+  bd)  +  {ae  -f-  bf), 

{ad  be)  d-  {of  d-  be)) 

=  {ac 

~\~  bd,  ad  be)  ( 

ae  +  bf,  af  d~  be) 

(by 

Def. 

5) 

=  {a,b){c,d)  +  (a,b)(e,f). 

(by 

Def. 

6) 

Definition  7. 

The  elements  of  I,  : 

subject  to  Definitions  4, 

5,  6, 

will 

be 

called  integers. 

Definition  8.  The  integers  ( m,n ),  where  m  >  n,  will  be  called  positive 
integers. 

Theorem  25.  Every  positive  integer  has  the  form  (x  +  a,  a). 

Definition  9.  If  two  sets  of  symbols  a,  b,  c  •  •  •  and  a,  (3,  y,  •  •  •,  where 
for  each  set  an  equality,  an  addition,  and  a  multiplication  have  been  defined, 
are  in  one-to-one  correspondence,  say 

a  <->  a,  b  (3,  c  y,  •  •  • , 

such  that  (1)  a  —  b  if  and  only  if  a  =  (3,  (2)  a  +  b  »  a  +  (3,  (3)  ab  •<— >•  a/3, 
then  we  say  that  the  two  sets  are  isomorphic  relative  to  the  operations  of  addition 
and  multiplication. 

Theorem  26.  The  positive  integers  (x  +  a,  a)  and  the  natural  numbers  x 
are  isomorphic  relative  to  the  operations  of  addition  and  multiplication. 

For,  under  the  correspondence  (x  +  a,  a)  ss  x,  we  have 

(1)  (*  +  a,  a)  =  (y  T  b,  b)  if  and  only  if  v  —  y, 

(2)  (x  +  a,  a)  +  (y  +  b,  b)  =  (x  -f  y  +  a  +  b,  a  +  b),  whence 

(x  +  a,  a)  +  (y  +  b,  b)  x  +  y, 

(3)  (x  +  a,  a)(y  +  b,  b)  =  (xy  +  xb  +  ay  +  ab  +  ab,  xb  +  ab  + 
ay  -f-  ab),  whence 

(x  +  a,  a)  (y  -f  b,  b)  ^  xy. 

Remark.  Because  of  the  isomorphism  established  in  Theorem  26,  the 
natural  numbers  are  embedded  within  the  integers,  and  the  integers  thus 
form  an  extension  of  the  set  of  natural  numbers. 

Definition  10.  Henceforth  we  shall  use  the  notation  x  for  the  positive 
integer  (x  +  a,  a). 
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Theorem  27.  (b,c)  =  (m,m)  if  and  only  if  b  =  c. 

Theorem  28.  For  every  integer  (x,y),  we  have  (x,y)  + 
and  (x,y)(m,m)  =  (m,m). 

For 

(x,y)  +  {m,m)  =  {x  -f  m,  y  -f-  m) 

=  (x,y), 

and 

{x,y){m,m)  =  { xm  f-  ym,  xm  +  ym) 

=  {m,m). 


(m,m)  =  (x,y) 


(by  Def.  5) 
(by  Th.  22) 

(by  Def.  6) 
(by  Th.  27) 


Definition  1 1 .  Henceforth  we  shall  use  the  notation  0  for  the  integer 
{m,m),  and  shall  call  this  integer  the  zero  integer. 

Theorem  29.  (m,n)  +  (n,m)  =  0. 

Definition  12.  We  shall  denote  {m,n)  by  —  {n,m). 

Definition  13.  The  integers  {m,n),  where  m  <  n,  will  be  called  negative 
integers. 

Theorem  30.  Every  negative  integer  has  the  form  (a,  a  +  x). 

Theorem  31.  (a,  a  +  x)  =  — x. 

For 

(a,  a  +  x)  =  —  (x  +  a,  a)  (by  Def.  12) 

=  x.  (by  Def.  10) 


The  familiar  properties  of  the  integers  may  now  be  developed.  Since  we 
lack  space  actually  to  do  this,  we  shall,  in  the  next  stage  of  our  program, 
freely  assume  those  properties  which  may  be  needed. 


The  rational  numbers 

[Motivation.  Any  rational  number  (positive,  zero,  or  negative)  can 
be  expressed  as  the  quotient  m/n  of  two  integers  m  and  n,  with  n  t6-  0;  in 
fact,  the  word  rational  has  its  origin  in  this  fact.  This  suggests  the  idea  of 
logically  introducing  the  rational  numbers  as  ordered  pairs  ( m,n )  of  integers, 
with  n  7^  0,  where  we  actually  have  in  mind  the  quotient  m/n.  With  this 
interpretation  of  an  ordered  pair  of  integers,  in  which  the  second  integer 
is  nonzero,  we  are  led  to  say  (a,b)  =  (c,d)  if  and  only  if  a/b  =  c/d,  or 
ad  =  be.  Similarly,  since  a/b  +  c/d  =  {ad  +  bc)/bd,  we  are  led  to  define 
(a,b)  +  (c,d)  to  be  the  ordered  pair  of  integers  {ad  +  be,  bd),  and  since 
{a/b)  {c/d)  =  ac/bd,  we  are  led  to  define  ( a,b){c,d )  to  be  the  ordered  pair 
of  integers  (ac,  bd).  ] 

Definition  14.  We  shall  denote  the  set  of  all  ordered  pairs  ( m,n )  of 
integers  m  and  n,  with  n  ^  0,  by  Q. 

Definition  15.  If  (a,b)  and  (c,d)  are  two  members  of  Q,  we  say  that 
{a,b)  =  { c,d )  if  and  only  if  ad  =  be. 
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Remark.  It  is  to  be  noted  that  the  definition  of  equality  of  members 
of  Q  is  not  the  ordinary  concept  of  equality  as  an  identity  of  symbols. 

Theorem  32.  Equality  of  members  of  Q  is  an  equivalence  relation ;  that  is, 
(1)  (a,b)  =  <a,b),  (2)  if  (a,b)  =  (c,d),  then  (c,d)  =  (a,b),  (3)  if  ( a,b)  = 
(c,d)  and  (c,d)  =  (e,f)y  then  (a,b)  =  (e,f). 

Definition  16.  We  define  the  sum  of  any  two  members  (a,b)  and  (c,d) 
of  Q,  taken  in  this  order,  to  be  the  ordered  pair  of  integers  {ad  -f-  be,  bd), 
and  we  write 

(a,b)  -f  ( c,d )  =  {ad  +  be,  bd). 

This  binary  operation  on  members  of  Q  will  be  called  addition. 

Theorem  33.  Q  is  closed  under  addition,  which  is  commutative,  associative, 
and  well  defined. 

Definition  17.  We  define  the  product  of  any  two  members  {a, b)  and 
{c,d)  of  Q,  taken  in  this  order,  to  be  the  ordered  pair  of  integers  {ac,bd), 
and  we  write 

{a,b){c,d)  =  {  ac,  bd). 

This  binary  operation  on  members  of  Q  will  be  called  multiplication. 

Theorem  34.  Q  is  closed  under  multiplication ,  which  is  commutative,  asso¬ 
ciative,  well  defined ,  and  distributive  over  addition. 

Definition  18.  The  elements  of  Q,  subject  to  Definitions  15,  16,  17, 
will  be  called  rational  numbers. 

Theorem  35.  The  rational  numbers  (x,l )  and  the  integers  x  are  isomorphic 
relative  to  the  operations  of  addition  and  multiplication. 

Remark.  Because  of  the  isomorphism  established  in  Theorem  35,  the 
integers  are  embedded  within  the  rational  numbers,  and  the  rational  numbers 
thus  form  an  extension  of  the  set  of  integers. 

Definition  19.  Henceforth  we  shall  use  the  notation  *  for  the  rational 
number  {x,\ )  and  shall  call  rational  numbers  of  the  form  {a,b),  where  a  ^ 
mb,  fractions. 

The  familiar  properties  of  the  rational  numbers  may  now  be  developed. 
Again,  since  we  lack  space  actually  to  deduce  these  properties,  we  shall, 
in  the  next  stage  of  our  program,  freely  assume  those  properties  which  may 
be  needed.  In  particular,  we  assume  the  order  properties  connecting  the 
rational  numbers. 

7.5  The  Real  Numbers  and  the 

Complex  Numbers 

In  the  first  part  of  this  section  we  continue  our  program  by  building  up 
the  real  number  system  from  the  rational  numbers.  This,  which  is  undoubt¬ 
edly  the  most  difficult  of  the  stages  in  building  up  the  complex  number 
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system  from  the  natural  numbers,  may  be  accomplished  in  several  ingenious 
ways.  We  select  a  procedure  due  to  Richard  Dedekind— a  procedure  which 
ties  in  with  some  of  our  earlier  work  and  with  the  algebraic  methods  that  we 
have  so  far  been  employing  in  the  step-by-step  extension  of  the  natural 
number  system.  A  second  procedure,  devised  by  Georg  Cantor  and  based 
upon  the  concept  of  a  regular  sequence  of  rational  numbers,  can  be  found 
in  many  textbooks  on  the  theory  of  functions  of  real  variables.*  A  third 
method,  based  upon  the  decimal  representation  of  real  numbers,  is  favored 
by  some  mathematicians,!  while  a  fourth  method,  involving  sequences  of 
nested  intervals,  is  preferred  by  others.  J 

The  real  numbers 

The  following  is  only  an  outline  of  the  present  stage  of  the  program. 
For  proofs  of  the  theorems  cited  the  reader  may,  for  example,  consult 
Goffman. 

[Motivation.  Dedekind  was  led  to  his  procedure  for  introducing  the 
real  numbers  by  the  fact  that  he  found  it  difficult  in  his  lectures  to  give  a 
clear  definition  of  continuity,  a  property  which  is  possessed  by  the  ordered 
set  of  real  numbers  but  which  is  not  possessed  by  the  ordered  set  of  rational 
numbers.  After  considerable  cogitation  on  the  difficulty,  Dedekind  perceived 
that  the  essence  of  the  continuity  of  a  straight  line  lies  in  the  property  that 
if  all  the  points  of  the  line  are  divided  into  two  classes,  such  that  every 
point  in  the  first  class  lies  to  the  left  of  every  point  in  the  second  class,  then 
there  exists  one  and  only  one  point  of  the  line  which  produces  this  severance 
of  the  line  into  the  two  classes.  Arithmetizing  this  idea  led  to  the  consideration 
of  cuts  in  the  ordered  set  of  rational  numbers,  and  to  a  definition  of  real 
numbers  that  guarantees  closure  of  the  real  number  set  under  the  operation 
of  forming  cuts  within  it,  thus  assuring  the  continuity  of  this  set.  Dedekind 
was  one  of  the  great  nineteenth-century  pioneers  in  the  logical  and  philo¬ 
sophical  study  of  the  foundations  of  analysis,  and  his  two  essays,  Stetigkeit 
und  irrationale  d^ahlen  (Continuity  and  Irrational  Numbers)  of  1872  and 
Was  sind  und  was  sollen  die  ahlem ?  (What  Are  and  What  Should  be  Numbers?) 
of  1887,  profoundly  influenced  subsequent  studies  in  the  foundations  of 
mathematics.  ] 

Definition  20.  A  Dedekind  cut  (A|B)  in  the  set  of  rational  numbers  is  a 
partition,  or  separation,  of  these  numbers  into  two  classes  A  and  B  such  that 
(1)  every  rational  number  is  either  in  A  or  in  B ,  (2)  A  and  B  each  contain 
at  least  one  rational  number,  (3)  every  element  of  A  is  less  than  every 
element  of  B. 

*Cf.  J.  Pierpont  [1],  Vol.  I.,  Chap.  2,  or  C.  Goffman,  Chap.  3. 

fCf.  J.  F.  Ritt  or  H.  Levi. 

fFor  an  elementary  exposition  of  this  method,  see  R.  Courant  and  H.  Robbins, 

Chap.  2. 
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Types  of  Dedekind  Cuts.  There  are  three  possible  types  of  Dedekind 
cuts  (A\B)  in  the  set  of  rational  numbers,  for  we  may  have  the  situations 
where  (a)  A  has  a  largest  element  a  and  B  has  no  smallest  element,  (b)  A  has 
no  largest  element  and  B  has  a  smallest  element  b,  (c)  A  has  no  largest  element 
and  B  has  no  smallest  element. 

A 

A  seeming  fourth  possibility,  where  A  has  a  largest  element  a  and  B 
has  a  smallest  element  b,  cannot  occur,  for  between  any  two  distinct  rational 
numbers  a  and  b  there  is  always  a  rational  number  c,  and  this  number  c 
could  not  lie  in  either  A  or  B ,  which  contradicts  the  formation  of  the  sets 
A  and  B. 

An  example  of  a  Dedekind  cut  of  type  (a)  is  the  cut  (A\B)  in  which  A 
consists  of  all  the  rational  numbers  less  than  or  equal  to  2,  while  B  contains 
all  the  remaining  rational  numbers.  Here  A  has  a  largest  element,  namely  2, 
and  B  has  no  smallest  element. 

An  example  of  a  Dedekind  cut  of  type  (b)  is  the  cut  (A\B)  in  which  A 
consists  of  all  the  rational  numbers  less  than  2,  while  B  contains  all  the 
remaining  rational  numbers.  Here  A  has  no  largest  element,  but  B  has  a 
smallest  element,  namely  2. 

An  example  of  a  Dedekind  cut  of  type  (c)  is  the  cut  (A\B)  in  which  A 
consists  of  all  the  nonpositive  rational  numbers  and  those  positive  rational 
numbers  whose  squares  are  less  than  2,  while  B  contains  all  the  remaining 
rational  numbers.  Here  A  has  no  largest  element  and  B  has  no  smallest 
element. 

Definition  21.  A  Dedekind  cut  of  the  rational  numbers  which  is  of 
type  (a)  or  type  (b)  is  called  a  rational  cut.  To  each  rational  number  r  there 
correspond  two  cuts,  one  of  type  (a)  and  one  of  type  (b).  In  order  to  have 
a  one-to-one  correspondence  between  the  rational  cuts  and  the  rational 
numbers,  we  shall  consider  only  rational  cuts  of  type  (b). 

Definition  22.  We  shall  denote  the  set  of  all  Dedekind  cuts  (A\B)  in 
the  set  of  rational  numbers  which  are  of  type  (b)  or  type  (c)  by  R. 

Definition  23.  If  (A\B)  and  (C\D)  are  members  of  R ,  we  say  that 
(A\B)  =  (C|Z))  if  and  only  if  sets  A  and  C  are  identical. 

Remark.  Note  that,  by  definition,  the  equality  of  members  of  R  is  an 
identity  equality. 

Theorem  36.  Let  (Ai|Bi),  (d2|R2)  be  two  members  of  R.  Let  A  be  the  set 
of  all  rational  numbers  of  the  form  x  -j-  y,  where  x  is  in  Ai  and  y  is  in  A2,  and  let 
B  be  the  set  of  all  remaining  rational  numbers.  Then  (A|B)  is  a  member  of  R. 

Definition  24.  We  define  the  sum  of  any  two  members  (Ai|i?i),  (d2|R2) 
of  R  to  be  the  member  (A\B)  of  Theorem  36.  We  write 

(di|Ri)  +  (A2|R2)  =  {A\B), 

and  refer  to  this  binary  operation  on  members  of  R  as  addition. 
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Theorem  37.  Addition  of  members  of  R  is  commutative  and  associative. 

Theorem  38.  Let  (A|B)  be  a  member  of  R  and  let  (MjP)  denote  the  member 
of  R  in  which  M  is  the  set  of  all  negative  rational  numbers  and  P  is  the  set  of  all 
the  remaining  rational  numbers.  Then  there  exists  a  unique  member  of  R,  which  we 
shall  denote  by  (A'lB'),  such  that  (A|B)  +  (A'|B')  =  (M|P). 

Remark.  The  member  (M\P)  plays  the  role  of  zero,  or  the  additive 
identity,  and  (A'\Br)  is  then  the  additive  inverse  of  (A\B). 

Definition  25.  Member  (d|Z?)  of  R  is  said  to  be  positive  if  A  contains 
positive  rational  numbers;  (A\B)  is  said  to  be  negative  if  B  contains  negative 
rational  numbers. 


Theorem  39.  If  (A|B)  is  positive ,  then  (A'|B')  is  negative ;  if  (A|B)  is 
negative ,  then  (A^B')  is  positive. 

Theorem  40.  Let  (Ai|Bi),  (A2|B2)  be  two  positive  members  of  R.  Let  A  be 
the  set  of  all  nonpositive  rational  numbers  and  those  positive  rational  numbers  of  the 
form  xy,  where  x  >  0  is  in  Ai  and  y  >  0  is  in  A2.  Let  B  be  the  set  of  all  remaining 
rational  numbers.  Then  (A|B)  is  a  member  of  R. 


B i)  is  posi- 
D'),  where 


Definition  26.  Let  (A\\Bi),  (T2|Z?2)  be  any  two  members  of  R.  If 
(dil^i)  and  (d2|R2)  are  both  positive,  we  define  their  product,  (Ti|5i)(T2|52), 
to  be  the  member  {A\B)  of  Theorem  40.  If  {A\\B\)  and  (A2|R2)  are  both 
negative,  we  define  their  product  to  be  (A\\Bi)  (A^Bf).  If  (A\ 
tive  and  (d2|R2)  is  negative,  we  define  their  product  to  be  ( C ' 

(C\D)  =  (di|Z?i) (Af\B2f).  If  (Ti|i?i)  is  negative  and  (d2|i?2)  is  positive,  we 
define  their  product  to  be  (C'|Z)'),  where  (C\D)  =  (A\\Bi)  (A2|#2).  If  either 
(A\\B\)  or  (d2|i?2)  is  the  member  (M\P),  we  define  their  product  to  be 
the  member  (M\P).  This  binary  operation  on  members  of  R  will  be  called 
multiplication. 


Theorem  41.  Multiplication  of  members  of  R  is  commutative ,  associative ,  and 
distributive  over  addition. 


Definition  27.  The  members  of  R ,  subject  to  Definitions  23,  24,  26, 
will  be  called  real  numbers. 

Theorem  42.  The  rational  cuts  (A|B)  of  type  (b),  in  which  r  represents  the 
smallest  element  of  B,  and  the  rational  numbers  r,  are  isomorphic  relative  to  the 
operations  of  addition  and  multiplication. 

Remark.  Because  of  the  isomorphism  of  Theorem  42,  the  rational 
numbers  are  embedded  within  the  real  numbers,  and  the  real  numbers  thus 
form  an  extension  of  the  set  of  rational  numbers. 

Definition  28.  The  members  of  R  which  are  rational  cuts  [those  of 
type  (b)  ]  are  known  as  rational  real  numbers  and  those  members  of  R  which 
are  not  rational  cuts  [those  of  type  (c)]  are  known  as  irrational  real  numbers. 
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All  the  postulates  of  a  complete  ordered  field,  as  listed  in  Section  7.2, 
can  now  be  shown  to  be  satisfied  if  we  interpret  the  elements  of  the  field  as 
the  Dedekind  cuts  (A|B)  of  R,  the  binary  operations  ©  and  ®  as  addition 
and  multiplication  of  members  of  R ,  z  as  the  rational  cut  (M\P)  correspond¬ 
ing  to  0,  a  as  (A\B)  and  a  as  (A'\Br),  u  as  the  rational  cut  corresponding  to  1, 
and  the  positive  and  negative  elements  of  the  field  as  the  positive  and  negative 
members  of  R.  It  follows,  then,  that  the  real  numbers ,  defined  as  members  of  R, 
constitute  a  complete  ordered  field ,  and  hence,  because  of  the  categoricalness  of 
the  postulate  set  for  a  complete  ordered  field,  are,  to  within  an  isomorphism, 
the  same  as  the  real  numbers  considered  in  Section  7.2. 

Since  we  were  able  to  extend  the  set  of  rational  numbers  to  the  set  of 
real  numbers  by  making  Dedekind  cuts  in  the  set  of  rational  numbers,  one 
might  wonder  if  we  can  similarly  extend  the  set  of  real  numbers  by  making 
Dedekind  cuts  in  this  set.  In  view  of  Dedekind’s  theorem  of  continuity  (see 
Section  7.2),  the  answer  to  this  question  is  no ;  that  is,  no  further  numbers 
can  be  created  by  the  method  of  Dedekind  cuts. 

The  complex  numbers 

The  last  stage  in  our  program  involves  the  building  of  the  complex 
number  system  upon  the  real  number  system  by  the  introduction  of  suitable 
definitions. 

[Motivation.  We  have  already,  in  Section  5.2,  given  some  consideration 
to  Hamilton’s  procedure  of  defining  the  complex  numbers  in  terms  of  ordered 
pairs  of  real  numbers.  For  the  sake  of  completeness,  we  record  this  procedure 
once  more.  Now  any  complex  number  is  customarily  written  in  the  form 
m  -f  in ,  where  m  and  n  are  real  numbers  and  i  is  the  imaginary  unit.  This 
suggests  the  idea  of  logically  introducing  the  complex  numbers  as  ordered 
pairs  [m,w]  of  real  numbers,  where  by  [m,n\  we  have  in  mind  the  represen¬ 
tation  m  +  in.  With  this  interpretation  of  an  ordered  pair  of  real  numbers 
we  are  led  to  say  [a,b]  =  [c,d~\  if  and  only  if  a  -f-  ib  =  c  +  id,  or  if  and  only  if 
a  =  c  and  b  =  d.  Similarly,  since  (a  +  ib)  +  (c  +  id)  —  (a  +  c)  -f-  i(b  T  d), 
we  are  led  to  define  [a,b]  +  [c,d]  to  be  the  ordered  pair  of  real  numbers 
[a  +  c,  b  +  d],  and  since  ( a  +  ib)(c  +  id)  =  ( ac  —  bd)  +  i(ad  +  be),  we 
are  led  to  define  [a,b]  [c,d]  to  be  the  ordered  pair  of  real  numbers  [ac  -bd, 
ad  +  be ] . ] 

Definition  29.  We  shall  denote  the  set  of  all  ordered  pairs  [m,n]  of 
real  numbers  by  C. 

Definition  30.  If  [a,b]  and  [c,d)  are  two  members  of  C,  we  say  that 
[a,b]  =  [c,d]  if  and  only  if  a  =  c  and  b  =  d. 

Remark.  Note  that,  by  definition,  the  equality  of  members  of  C  is  an 
identity  equality. 
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Definition  31.  We  define  the  sum  of  any  two  members  [<2,6]  and 
[c,d]  of  C,  taken  in  this  order,  to  be  the  ordered  pair  of  real  numbers 
[a  +  c,  b  +  d],  and  we  write 

\a,b]  +  [c,d]  =  [a  +  c,  b  +  d]. 

This  binary  operation  on  members  of  C  will  be  called  addition. 

Theorem  43.  C  is  closed  under  addition,  which  is  commutative  and  associative. 

Definition  32.  We  define  the  product  of  any  two  members  [a,b]  and 
[c,d]  of  C,  taken  in  this  order,  to  be  the  ordered  pair  of  real  numbers 
[ac  —  bd,  ad  +  be],  and  we  write 

1 \a,b][c,d]  =  [ac  —  bd,  ad  +  be]. 

This  binary  operation  on  members  of  C  will  be  called  multiplication. 

Theorem  44.  C  is  closed  under  multiplication,  which  is  commutative,  associative, 
and  distributive  over  addition. 

Definition  33.  The  elements  of  C,  subject  to  Definitions  30,  31,  32, 
will  be  called  complex  numbers. 

Theorem  45.  The  complex  numbers  [a,0]  and  the  real  numbers  a  are  isomorphic 
relative  to  the  operations  of  addition  and  multiplication. 

Remark.  Because  of  the  isomorphism  of  Theorem  45,  the  real  numbers 
are  embedded  within  the  complex  numbers,  and  the  complex  numbers  thus 
form  an  extension  of  the  set  of  real  numbers. 

Definition  34.  Henceforth  we  shall  use  the  notation  a  for  the  complex 
number  [<2,0  ] . 

Definition  35.  We  shall  denote  the  complex  number  [0,1]  by  i. 
Theorem  46.  i2  =  —1. 

Theorem  47.  [0,b]  =  ib. 

Definition  36.  Complex  numbers  of  the  form  [0,6]  shall  be  called  pure 
imaginary  numbers,  and  those  of  the  form  [a,b],  where  a  ^  0  and  b  0, 
shall  be  called  mixed  imaginary  numbers. 

Theorem  48.  [a,b]  =  a  +  ib. 

Theorem  49.  When  we  use  the  notation  a  +  ib  for  [a,b],  we  may  carry 
out  addition  and  multiplication  as  with  real  polynomials  in  i,  provided  we  replace 
i2,  wherever  it  occurs,  by  —  1. 

With  this  we  bring  to  a  close  our  sketch  of  the  program  by  which  the 
real  and  the  complex  number  systems  are  obtained,  without  the  need  of  any 
further  postulates,  from  the  basic  system  of  natural  numbers.  The  accom¬ 
plishment  of  this  program  is  an  amazing  achievement.  Since  we  see  that 
the  consistency  of  the  great  bulk  of  mathematics  can  be  made  to  rest  upon 
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that  of  the  real  number  system,  it  is  now  apparent  that  the  consistency  of 
the  great  bulk  of  mathematics  actually  can  be  made  to  rest  upon  that  of  the 
much  simpler  system  of  natural  numbers.  In  some  sense,  then,  the  ancient 
Pythagorean  belief  that  everything  depends  upon  the  whole  numbers  is 
justified,  and  we  see  meaning  in  Leopold  Kronecker’s  often  quoted  remark, 
“Die  ganzen  Zahlen  hat  Gott  gemacht,  alles  andere  ist  Menschenwerk.”  * 

The  program  which  we  followed  in  reaching  the  complex  number 
system  from  the  natural  number  system  can  be  represented  by  the  scheme 

where  N  represents  the  system  of  natural  numbers,  /  the  system  of  integers, 
Q  the  system  of  rational  numbers,  R  the  system  of  real  numbers,  and  C 
the  system  of  complex  numbers.  Various  other  developments  are  possible, 
of  which  a  commonly  followed  program  isf 

N  — ■>  Q( + )  —>  R(~h)  >  R  C, 

where  (?(  +  )  represents  the  system  of  positive  rational  numbers  and  /?(  +  ) 
the  system  of  positive  real  numbers.  It  is  interesting  that  this  alternative 
development  reflects  the  historical  growth  of  our  number  system.  For 
mankind  first  employed  the  whole,  or  natural,  numbers  for  counting,  then  the 
positive  fractions  for  purposes  of  measurement;  next  he  introduced  positive 
real  numbers  to  cover  incommensurable  situations  (like  the  side  and  diagonal 
of  a  square),  then  acknowledged  negative  numbers,  and,  finally,  accepted 
imaginary  numbers.  On  the  other  hand,  the  development  of  the  program 
as  we  pursued  it  reflects  the  successive  algebraic  need  for  new  numbers. 
Thus,  if  a  and  b  are  natural  numbers,  the  equation  x  +  a  =  b  will  always 
have  a  solution  only  if  we  extend  the  number  system  to  include  all  integers. 
Again,  if  a  0  and  b  are  integers,  the  equation  ax  =  b  will  always  have  a 
solution  only  if  we  extend  the  number  system  to  include  all  rational  numbers. 
These  numbers,  though,  are  not  sufficient  to  yield  a  solution  to  the  equation 
x2  =  2,  and  so  the  real  numbers  are  introduced.  Finally,  the  equation 
x2  =  — 2  has  no  solution  until  the  complex  numbers  are  introduced.  But 
here  we  reach  an  end,  for  it  has  been  shown  that  any  polynomial  equation 
having  coefficients  in  the  complex  number  field  has  a  solution  within  this 
field— this  is  the  famous  so-called  fundamental  theorem  of  (classical)  algebra. 

The  complex  number  system  need  not  be  obtained  by  definition  from 
the  real  number  system;  it,  too,  can  be  given  by  an  appropriate  categorical 
set  of  postulates.  We  shall  not  list  such  a  set  of  postulates  here,  X  however, 

*“God  made  the  whole  numbers,  all  the  rest  is  the  work  of  man. 

fThis  is  the  program  pursued  in  E.  G.  H.  Landau  [1]. 

fSuch  a  postulate  set  for  the  complex  number  system  may  be  found  in  E.  V. 

Huntington  [10]. 
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but  shall  conclude  with  the  following  schema,  which  exhibits  a  classification 
of  the  various  kinds  of  numbers  discussed  in  this  chapter. 


natural  numbers 
zero 

negative  integers 


integers  1 
fractions 


>  rational  nos. 


irrational  nos. 
pure  imaginary  nos. 
mixed  imaginary  nos. 


real  nos. 


complex  nos. 


f  imaginary  nos.  j 


_ PROBLEMS 

7.1.1  (a)  Under  what  conditions  on  the  real  number  *  is 

V  1  —  sin2  v  =  cos  x? 

(b)  Under  what  conditions  does  the  equation  ax  =  b,  where  a  and  b 
are  real  numbers,  have  exactly  one  real  solution?  no  real  solution?  infinitely 
many  real  solutions? 

7.1.2  (a)  If  A  is  a  real  number,  we  define  bb  to  be  an  abbreviation  for  (b)(b) 
(b)(b)(b),  and,  more  generally,  if  n  is  any  positive  integer,  we  define  bn  to  be 
an  abbreviation  for  (b)(b)  •  •  fib),  where  there  are  n  factors  in  the  product. 
In  the  symbol  btl,  b  is  referred  to  as  the  base  and  n  as  the  exponent.  Show 
that  the  following  so-called  laws  ofi  exponents,  in  which  a  and  b  represent  real 
numbers  and  m  and  n  represent  positive  integers,  are  merely  commonsense 
observations  based  on  the  above  definition: 

1.  ( bm)(bn )  =  bm+n. 
bm 

2.  —  =  bm  n  if  m  >  n  and  b  0. 
bn 

3.  (bm)n  =  bmn. 

4.  ( ab)n  =  anbn. 

(b)  The  extension  of  the  notion  of  exponents  to  include  all  rational 
numbers  as  exponents  is  accomplished  by  definition  and  is  motivated  by  a 
desire  to  preserve  the  above  laws  of  exponents.  (1)  Show  that  if  we  wish 
to  preserve  law  2  for  the  situation  where  m  =  n,  we  are  forced,  for  b  ^  0, 
to  define  b°  =  1.  (2)  Show  that  if  we  wish  to  preserve  law  2  for  the  situation 
where  n  =  m  T  1,  we  are  forced,  for  b  9^  0,  to  define  b~l  to  mean  \/b. 
(3)  Show  that  if  we  wish  to  preserve  law  3,  we  are  forced  to  define  bl,n, 
where  n  is  a  positive  integer,  to  mean  an  rzth  root  of  b. 

(c)  In  part  (b)  we  have  seen  how  one  would  try  to  formulate  a  set  of 
definitions  which  might  permit  the  universal  application  of  the  four  expo¬ 
nential  laws  to  numbers  written  in  exponential  form,  where  the  bases  of 
these  numbers  are  real  and  the  exponents  are  rational.  Each  definition  was 
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motivated  by  the  effort  to  preserve  some  one  of  the  four  laws.  It  could  well 
be  that  it  is  impossible  in  this  way  to  obtain  suitable  definitions  that  would 
satisfy  all  the  exponential  laws.  Show  that  this  is  actually  the  case  by  (1) 
evaluating  [( —  3) 2 ] 1/2  in  two  ways,  (2)  evaluating  [( —  4) ( —  4)  ] 1/2  in  two 
ways.  Many  students  of  elementary  algebra  are  under  the  delusion  that  the 
four  laws  of  exponents  hbld  universally  for  exponential  numbers  having  real 
bases  and  rational  exponents;  indeed,  many  textbooks  of  elementary  algebra 
make  this  erroneous  statement.  The  examples  above  show  that  this  is  not  so. 
It  can  be  shown,  however,  that  the  jour  exponential  laws  do  hold  for  real  bases 
and  rational  exponents  if  we  exclude  the  situations  where  the  base  is  negative  and  the 
exponent  is  a  reduced  fraction  with  an  even  denominator .  Without  an  understanding 
of  the  limitations  involved  in  the  use  of  the  exponential  laws  for  rational 
exponents,  a  student  can  become  involved  in  paradoxical  results. 

7.1.3  Explain  the  following  paradoxes  involving  square  root  radicals: 

(a)  Since  V a  V b  =  y/ab,  we  have 

V—i  V-i  =  V(-1)(-1)  =  VT  =  i. 

But,  by  definition,  V  — 1  V  —  1  =  —1.  Hence  —1  =  +1. 

(b)  We  have,  successively, 

V-i  =  V  —  i, 


V\  V 
V — i  V  i 


V  i  V  i  =  V  —  i  V  — 


1  =  -1. 

(c)  Consider  the  following  identity,  which  holds  for  all  values  of  *  andjy: 


V 


x 


y 


=  ivy 


x. 


Setting  x  =  a,  y  =  b,  where  a  b,  we  find 

Va  —  b  =  z'V b  —  a. 


Now  setting  x  =  b,  y  —  a,  we  find 

V b  —  a  =  iy/  a  —  b. 


Multiplying  the  last  two  equations,  member  by  member,  we  get 

V a  —  b  y/ b  —  a  =  i2^ b  —  a  V a  —  b. 
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Dividing  both  sides  by  Va  —  b  V b  —  a,  we  finally  get 

1  =  A,  or  1  =  —  1 . 

7.1.4  In  elementary  algebra  one  often  learns  the  “axioms”: 

A1 :  If  equals  are  added  to  equals ,  the  results  are  vqual. 

A2:  If  equals  are  subtracted  from,  equals ,  the  results  are  equal. 

A3:  If  equals  are  multiplied  by  equals ,  the  results  are  equal. 

A4:  If  equals  are  divided  by  equals ,  the  results  are  equal. 

A5:  Equal  roots  of  equals  are  equal. 

Explain  the  following  paradoxes,  which  use  only  these  axioms: 

(a)  Let  us  solve  the  simple  equation 

x  —  \  —  2 

in  the  following  roundabout  way.  Multiplying  both  sides  by  x  —  5,  we  get, 
by  A3, 

x2  —  6*  +  5  =  2x  —  10. 

Now  subtracting  x  —  7  from  both  sides,  we  get,  by  A2, 

x2  —  lx  -j-  12  =  x  —  3. 

Next,  dividing  both  sides  by  x  —  3,  we  get,  by  A4, 

x  —  4  =  1 . 

Finally,  adding  4  to  both  sides,  we  get,  by  Al, 

x  =  5. 

But  this  answer  is  not  correct,  for  the  value  x  =  5  does  not  satisfy  the  original 
equation. 

(b)  Let  a  and  b  be  any  two  numbers  and  denote  their  sum  by  2c.  Then 

a  +  b  =  2c. 

Multiplying  both  sides  by  a  —  b,  we  get,  by  A3, 

a2  —  b2  =  2  ac  —  2  be. 

Adding  b2  T  c2  —  2 ac  to  both  sides,  we  get,  by  Al, 

a2  —  2  ac  +  c2  =  b2  —  2  be  +  c2 
or 

(a  —  c)2  =  {b  —  c)2. 

Taking  the  square  root  of  each  side,  we  have,  by  A5, 

a  —  c  =  b  —  c. 

Adding  c  to  both  sides,  we  get,  by  Al, 


a  =  b. 
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But  a  and  b  were  chosen  as  any  two  numbers.  Therefore  any  two  numbers 
are  equal  to  each  other. 

7.1.5  Find  the  sum  of  the  roots  of  the  equation 

*  4-  3  x  —  3  x  +  2 

i  o  i  m 


X2  -  1 


x*  —  x 


X2  X 


7.1.6  Most  students  of  elementary  algebra  will  agree  to  the  following 
theorem:  “If  two  fractions  are  equal  and  have  equal  numerators,  then  they 
also  have  equal  denominators.”  Now  consider  the  following  problem.  We 
wish  to  solve  the  equation 


*  +  5 


5  = 


4x  —  40 


x  —  7  "  13  —  x  ' 

Combining  the  terms  on  the  left  side,  we  find 

(x  +  5)  —  5(x  —  7)  4x  —  40 


7 


or 


4x  —  40 

1  -  x 


13 

4x  —  40 
13  -  x  ' 


By  the  above  theorem,  it  follows  that  7  —  x  =  13  —  x,  or,  upon  adding  x 
to  both  sides,  that  7  =  13.  What  is  wrong? 

a  c 

7.1.7  Explain  the  following  paradox:  Let  -  and  -  be  any  two  equal 


fractions.  Then 


a  c 

b  ~  '  =  d  ~ 


That  is 


a 


—  b  c  —  d 


d 


and 


and 


a  c 

M1  =  ~d  +  1- 


a  T  b  c  4~  d 
b  =  d  ~ 


It  follows  that 


a  —  b  c  —  d 


a  b  c  d4 

(■ a  —  b)(c  +  d)  =  {a  +  b)(c  —  d), 
ac  —  be  T  ad  —  bd  —  ac  be  ad  bd, 

ad  —  be  =  —{ad  —  be), 

1  =  -1. 

7.1.8  Explain  the  following  paradox: 

Certainly 

3  >  2. 

Multiplying  both  sides  by  log  (1/2),  we  find 

3  log  (1  /2)  >  2  log  (1 1 2) 

or 

log  (1/2)3  >  log  (1/2)2 
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whence 

(1/2 )3  >  (1/2)2  or  1/8  >  1/4. 

7.1.9  Explain  the  following  paradox: 

Clearly  (  —  l)2  =  (  +  1)2.  Taking  the  logarithm  of  each  side,  we  have 
log  (  —  l)2  =  log  (l)2.  Therefore  2  log  (  —  1)  =  2  log  1,  or  —1  =  1. 

7.1.10  ( for  students  who  have  studied  calculus)  Resolve  the  following  para¬ 
doxes: 

(a)  By  standard  procedure  we  find, 


But  the  function y  =  \jx2  is  never  negative;  hence  the  above  “evaluation” 
cannot  be  correct. 

(b)  Let  e  denote  the  eccentricity  of  the  ellipse  x2la2  T  y2lb2  =  1.  It  is 
well  known  that  the  length  r  of  the  radius  vector  drawn  from  the  left-hand 
focus  of  the  ellipse  to  any  point  P\(xy)  on  the  curve  is  given  by  r  =  a  -f-  ex. 
Now  dr/dx  =  e.  Since  there  are  no  values  of  x  for  which  drldx  vanishes,  it 
follows  that  r  has  no  maximum  or  minimum.  But  the  only  closed  curve  for 
which  the  radius  vector  has  no  maximum  or  minimum  is  a  circle.  It  follows 
that  every  ellipse  is  a  circle. 

(c)  Consider  the  isosceles  triangle  ABC  of  Figure  31,  in  which  base 

C 


FIG.  31 

AB  =  12  and  altitude  CD  =  3.  Surely  there  is  a  point  P  on  CD  such  that 

S  =  PC  +  PA  +  PB 

is  a  minimum.  Let  us  try  to  locate  this  point  P.  Denote  DP  by  x.  Then 
PC  =  3  —  x  and  PA  =  PB  =  (x2  +  36) 1/2.  Therefore 

S’  =  3  -  x  +  2(*2  +  36) 1/2, 

and 

dS 

—  =  -1  +  2*(*2  +  36)~1/2. 

Setting  dSjdx  =  0,  we  find  x  =  2v/3  >  3,  and  P  lies  outside  the  triangle  on 
DC  produced.  Hence  there  is  no  point  on  the  segment  CD  for  which  S  is 
a  minimum. 
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(d)  Consider  the  integral 


I  —  I  sin  x  cos  x  dx. 


Then  we  have 
I  = 


Also 


/  = 


sin  v  (cos  v  dx) 


cos  v  (sin  v  dx)  = 


sin  v  (/(sin  *)  = 


sin*  v 


cos  v  ^(cos  x)  — 


COSz  V 


Therefore 

or 

But,  for  any  x, 
(e)  Since 


sin2  v  =  —cos2  v, 


sin2  v  +  cos2  v  =  0. 
sin2  v  +  cos2  x  =  1. 

—  dx 


(  dx 
x 


—  x 


we  have  log  x  =  log  (  —  x)  or  x  —  —  x,  whence  1  =  —  1. 
(0  Let  S  denote  the  sum  of  the  convergent  series 

1 


1  1 

(1)(3)  +  (3) (5)  +  (5) (7) 


+ 


Then 


,s' _  1 1  ~  3) +  (5 


5) +  (I  “  7 1  + 


2  2  3  3  4 

=  1_3  +  3~5  +  5_7  + 


=  1, 


since  all  terms  after  the  first  cancel  out.  Again 

(i/i)  - 


s  = 


(1/3)  ,  (1/3)  -  (1/5)  ,  (1/5)  -  (1/7)  , 
+  o  r  ^  T 


1  1  1  J_  J_  _1_  _  1_ 

2  _  6  +  6  _  10  +  10  _  14  +  "  '  ~  2’ 


since  all  terms  after  the  first  cancel  out.  It  follows  that  1  =  1/2. 

(g)  Explain  the  paradoxical  results  concerning  the  series  S  in  Section 
7.1. 

7.2.1  Give  a  definition  of  a  lower  bound  and  of  a  greatest  lower  bound  of 
a  collection  M  of  real  numbers. 

7.2.2  Prove  that  any  collection  of  real  numbers  can  have  at  most  one  least 
upper  bound. 
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7.2.3  Give  an  example  of  a  collection  M  of  real  numbers  which  has 

(a)  both  an  upper  and  a  lower  bound; 

(b)  an  upper  bound  but  no  lower  bound; 

(c)  a  lower  bound  but  no  upper  bound; 

(d)  neither  an  upper  nor  a  lower  bound; 

(e)  a  least  upper  bound  which  is  in  the  collection  M; 

(f)  a  least  upper  bound  which  is  not  in  the  collection  M. 

7.2.4  Prove  that  if  a  collection  M  of  real  numbers  has  a  lower  bound,  then 
it  has  a  greatest  lower  bound. 

7.2.5  Let  M  be  a  collection  of  real  numbers,  and  let  t  be  any  fixed  positive 
real  number.  Let  N  be  the  collection  of  numbers  of  the  form  tx,  where  * 
is  in  M.  Show  that  if  b  is  the  least  upper  bound  of  M,  then  tb  is  the  least 
upper  bound  of  N. 

7.2.6  Show  that  if  a  is  a  positive  real  number  there  is  a  positive  integer  n 
such  that  \/n  <  a. 

7.2.7  Let  M  and  N  be  two  collections  of  real  numbers  having  a  and  b , 
respectively,  as  least  upper  bounds.  Let  P  be  the  collection  of  all  number 
of  the  form  x  y,  where  *  is  in  M  and  y  is  in  N.  Show  that  a  +  b  is  the 
least  upper  bound  of  P. 

7.2.8  Let  M  be  the  collection  of  real  numbers 

*n  =  (-1)"[2  -  4/2-],  n  =  1,  2, 

Find  the  least  upper  bound  and  greatest  lower  bound  of  M.  Do  the  same 
for  the  collection  of  real  numbers 

yn  =  (  —  l)n  +  1  /n,  n  =  1,2,  *  -  - . 

7.2.9  Does  the  ordered  system  of  all  nonzero  real  numbers  satisfy  the 
postulate  of  continuity? 

7.2.10  State  the  Archimedean  law  and  Dedekind’s  theorem  of  continuity 
in  the  terminology  and  symbolism  of  a  complete  ordered  field. 

7.2.11  Show  that  for  any  real  number  r  there  is  a  smallest  integer  n  such 
that  m  >  r. 

7.2.12  (a)  Show  that  between  any  two  real  numbers  c  and  d,  c  >  d,  there 
lies  at  least  one  other  real  number. 

(b)  Show  that  between  any  two  real  numbers  c  and  d,  c  >  d,  there  lies 
a  rational  number  min. 

7.3.1  Deduce  the  following  easy  theorems  about  any  natural  numbers 
<3,  b ,  c,  d: 

(a)  If  a  >  b  and  b  >  then  a  >  c. 

(b)  If  d  b ,  then  a  c  b  c . 

(c)  If  c  +  a  <  c  +  b,  then  a  <  b.  (This  is  the  converse  of  Theorem  7 
of  Section  7.3.) 

(d)  If  ca  <  cb,  then  a  <  b.  (This  is  the  converse  of  Theorem  9  of 
Section  7.3.) 
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(e)  l{  a  <  b  and  c  <  d,  then  ac  <  bd. 

(f)  If  a  >  b,  then  a  ^  b  +  1. 

7.3.2  If  n  is  an  arbitrary  natural  number,  prove,  by  means  of  the  principle 
of  mathematical  induction,  that 

(a)  l2  +  22  -f-  •  •  ^  +  n2  =  n(n  +  1)(2 n  +  l)/6. 

(b)  l3  +  23  +  •  •  •  +  n*  =  [n(n  -f  l)/2]2. 

(c)  (1)(2)  +  (2) (3)  +  •  •  •  +  n(n  T  1)  =  n(n  +  1  )(n  +  2)/3. 

(d)  1/(1)(2)  +  l/(2)(3)  +  •  •  •  +  1  /n(n  +  1)  =  n/(n  +  1). 

(e)  8n  —  3n  is  divisible  by  5. 

7.3.3  (i harder  situations)  If  n  is  an  arbitrary  natural  number,  prove,  by  means 
of  the  principle  of  mathematical  induction,  that 

(a)  1 5  -f-  2 5  — |—  •  •  •  +  nb  =  n2(n  -f-  l)2(2rc2  +  2 n  —  1)/12. 

(b)  l2  +  42  +  72  +  •  •  •  +  (3 n  -  2)2  =  n(6n2  -3 n  -  l)/2. 

(c)  (1  +  a)n  ^  1  +  na  if  a  >  —1. 

(d)  ft3  -f-  1  >  n2  +  n  for  n  ^  2. 

(e)  32n+2  —  Sn  —  9  is  divisible  by  64. 

(f)  9n  —  8n  —  1  is  divisible  by  64. 

(g)  72n  —  48 n  —  1  is  divisible  by  2304. 

(h)  a 11  —  bn  is  divisible  by  a  —  b. 

7.3.4  Establish  the  following  geometrical  theorems  by  mathematical 
induction: 

(a)  The  sum  of  the  interior  angles  of  a  convex  n-g on  is  (n  —  2)180°. 

(b)  The  maximum  number  of  lines  determined  by  n  ^  2  points  in  a 
plane  is  n(n  —  l)/2. 

(c)  The  maximum  number  of  points  of  intersection  of  n  ^  2  lines  in 
a  plane  is  n(n  —  l)/2. 

(d)  The  number  of  regions  into  which  n  straight  lines  divide  a  plane 
can  never  exceed  2n. 

7.3.5  If  an  amount  of  money  P  is  invested  at  an  annual  interest  rate  r 
and  interest  is  compounded  annually,  show  that  after  n  years  the  sum 
amounts  to  P(  1  T  r)n- 

7.3.6  By  means  of  the  second  principle  of  mathematical  induction  prove 
that 

(a)  for  every  natural  number  n,  n  +  1  is  a  prime  or  can  be  factored 
into  primes; 

(b)  the  number  of  positive  prime  factors  of  n  ^  2  is  less  than  2  loge  n; 

(c)  in  every  set  of  n  distinct  natural  numbers  there  is  a  greatest. 

7.3.7  Show  that  for  every  natural  number  n 

(i  +  Vs)n  —  (l  —  Vs)n 

un  —  - - 

2nv  5 

is  a  natural  number. 

7.3.8  (a)  Show  that  n2  —  n  -J-  41  is  a  prime  number  for  n  =  1,  2,  •  •  •,  40. 
Is  this  sufficient  to  prove  that  n2  —  n  T  41  is  a  prime  number  for  all  natural 
numbers  n?  Is  n2  —  n  +  41  a  prime  number  for  all  natural  numbers  n? 
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(b)  Show  that  ifl  +  2+  •••+«  =  («+  1  /2)  2/2  is  true  for  n  = 
k  ^  1,  then  it  is  true  for  n  =  k  +  1.  Is  this  sufficient  to  prove  that  the 
equality  holds  for  all  natural  numbers  n?  Does  the  equality  hold  for  all 
natural  numbers  n? 

7.3.9  Find  the  fallacy  in  the  following  proof  by  mathematical  induction: 
P(n) :  All  numbers  in  a  set  of  n  numbers  are  equal  to  one  another. 

I.  P(l)  is  obviously  true. 

II.  Suppose  k  is  a  natural  number  for  which  P(k)  is  true.  Let  ai,  a2  •  •  •, 
ak ,  aic+i  be  any  set  of  k  +  1  numbers.  Then,  by  the  supposition,  a\  —  a%  = 

•  •  •  =  ak  and  a2  =  •  •  •  =  a*  =  a*+i.  Therefore  a\  —  a%  =  •  •  •  =  ak  =  a*+i, 
and  P(k  T  1)  is  true. 

It  follows  that  P{n)  is  true  for  all  natural  numbers  n. 

7.3.10  Find  the  fallacy  in  the  following  proof  by  mathematical  induction: 
P{n)\  If  a  and  b  are  any  two  natural  numbers  such  that  max(a.,b)  —  n,  then 

a  =  b.  [Note:  By  ma x(a,b),  when  a  ^  b,  is  meant  the  larger  of  the  two 
numbers  a  and  b.  By  max  (a, a)  is  meant  the  number  a.  Thus  max(5,7)  =  7, 
max(8,2)  =  8,  max(4,4)  =  4.] 

I.  P(l)  is  obviously  true. 

II.  Suppose  k  is  a  natural  number  for  which  P(k)  is  true.  Let  a  and  b 
be  any  two  natural  numbers  such  that  ma x(a,b)  =  k  +  1,  and  consider 
a  =  a  —  1,  (3  =  b  —  1.  Then  ma x(a,(3)  =  k,  whence,  by  the  supposition, 
a  =  (3.  Therefore  a  —  b  and  P{k  +  1)  is  true. 

It  follows  that  P(n)  is  true  for  all  natural  numbers  n. 

7.3.11  Comment  on  the  following  amusing  “proof”  that  every  natural  number 
is  interesting : 

Suppose  it  is  not  true  that  every  natural  number  is  interesting,  and  let 
M  be  the  set  of  those  natural  numbers  which  are  not  interesting.  Then  M 
is  nonempty.  Therefore,  by  the  principle  of  the  least  natural  number,  M 
contains  a  smallest  natural  number  m.  That  is,  m  is  the  smallest  natural 
number  which  is  not  interesting.  But  this  is  interesting! 

7.3.12  Deduce  Postulate  N10  (the  postulate  of  finite  induction)  from 
Postulates  N1  through  N9  together  with  the  principle  of  the  least  natural 
number.  (This  will  show  that  the  postulate  of  finite  induction  and  the 
principle  of  the  least  natural  number  are  equivalent,  and  that  either  one 
can  be  used  in  place  of  Postulate  N10  of  the  postulate  set  for  the  natural 
number  system  as  given  in  Section  7.3.) 

7.3.13  Show  that  Peano’s  postulate  set  for  the  natural  number  system  is 
implied  by  the  postulate  set  for  the  natural  number  system  given  in  Section 
7.3. 

7.3.14  (a)  Establish,  as  far  as  you  can,  type-I  sequential  independence  of 
Postulates  N1  through  N10  (see  Problem  6.5.5.) 

(b)  Establish,  as  far  as  you  can,  type-II  sequential  independence  of  the 
Peano  postulates  for  the  natural  number  system  as  listed  in  Section  7.3 
(see  Problem  6.5.6). 
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7.4.1  Give  complete  proofs  of  the 

(a)  Theorem  21. 

(b)  Theorem  22. 

(c)  Theorem  23. 

(d)  Theorem  24. 


following  theorems  of  Section 

(e)  Theorem  25. 

(f)  Theorem  27. 

(g)  Theorem  29. 

(h)  Theorem  30. 


7.4: 


7.4.2  Prove  the  following  theorems  about  integers: 

(a)  If  (a,b)  +  (c,d)  =  (a,b)  +  (, ej ),  then  (c,d)  =  (<?,/). 

(b)  If (x,y)(a,b)  =  (x,y)(c,d)  and  (x,y)  then  ( a,b )  =  (c,d). 

(c)  The  equation  (a, b)  +  (x,y)  =  (c,d)  has  a  unique  solution. 

7.4.3  Prove  the  following  theorems  about  integers  x,  y,  z- 

(a)  -(-y)  =y. 

(b)  (-x)(-j)  =  xy. 

(c)  {~x)y  =  x(-y)  =  ~(xy). 

Agreement.  We  agree  to  write  x  +  (— y)  as  x  —  y. 

(d)  —  (x  +  y)  =  —  x  —  y. 

(e)  x(y  —  z)  =  xy  —  xz. 

(f)  If  xy  =  0,  then  x  =  0  or  y  =  0. 

7.4.4  We  may  define  inequalities  between  integers  in  terms  of  inequalities 
between  natural  numbers  as  follows.  We  say  ( a,b )  <  (c,d),  or  (c,d)  >  (a,b), 
if  and  only  if  c  +  b  >  d  T  a.  On  the  basis  of  this  definition,  prove  the  follow¬ 
ing  properties  of  the  inequality  signs  for  integers  x,  y,  z- 

(a)  If  x  <  y  and  y  <  z,  then  x  <£. 

(b)  If  x  <  y,  then  x  +  z  <  y  +  Z- 

(c)  If*  <  y  and  z  >  0,  then  x^:  <  yz. 

(d)  If  x  <  y  and  z  <  0,  then  xz  >  yz- 

7.4.5  Give  proofs  of  the  following  theorems  of  Section  7.4: 

(a)  Theorem  32. 

(b)  Theorem  33. 

(c)  Theorem  34. 

7.4.6  Prove  the  following  theorems  about  rational  numbers: 

(a)  (a,b)  +  (0,1 )  =  (a,b). 

(b)  j (a,b)  +  (  —  a,b)  =  (0,1  >. 

(c)  (a,b)(m,m)  =  ( a,b ). 

(d)  If  (a,b)  +  (c,d)  =  ( a,b )  +  (e,f),  then  (c,d)  =  (ej). 

(e)  If  ( a,b)(c,d )  =  ( a,b)(e,f ),  and  a  ^  0,  then  (c,d)  =  (e,f). 

(f)  The  equation  ( a,b){x,y )  =  ( c,d )  has  a  unique  solution  if  a  ^  0. 

7.4.7  Let  a+  denote  a  positive  integer  (or  zero)  and  let  ar  denote  the 
corresponding  negative  integer  (or  zero).  Show  that 

(a)  (a-,b-)  =  (a+,b+). 

(b)  (a+,M)  =  (a~,b+). 

Agreement.  Of  the  two  equal  forms  (a~,b~),  (a+,b+)  we  agree  to  use 
only  the  second,  and  of  the  two  equal  forms  ( a+,b~ ),  (a~,b+)  we  agree  to 
use  only  the  second. 

Definition.  We  now  say  (a,b)  >  (c,d)  if  and  only  if  ad  >  be. 

Show  that 

(c)  if  (a,b)  >  (c,d)  and  (c,d)  >  (e,f),  then  (a,b)  >  (e,f). 
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(d)  if  (a,b)  >  ( c,d ),  then  (a,b)  +  (e,f)  >  ( c,d )  +  (e,f). 

(e)  if  (a, b)  >  ( c,d )  and  (e,f)  >  (g,h)>  then  ( a,b )  +  ( e,f )  >  (c,d)  + 

(f)  if  (a,b)  >  (c,d)  and  (e,f)  >  (0 ,b),  then  (a,b)(ej)  >  ( c,d)(e,f ). 

(g)  if  (a,b)  >  (c,d)  and  (0 ,b)  >  (ej),  then  (c,d)(e,f)  >  ( a,b)(ej ). 

7.4.8  (a)  Prove  Theorem  35  of  Section  7.4. 

(b)  Show  that  the  order  relation,  > ,  for  rational  numbers  (see  Problem 
7.4.7)  is  preserved  under  the  isomorphism  of  Theorem  35. 


7.5.1  Supply  proofs  for  the  following  theorems  of  Section  7.5: 

(a)  Theorem  36. 

(b)  Theorem  37. 

(c)  Theorem  38. 

(d)  Theorem  39. 

7.5.2  Let  (Ti|i?i),  (^42|i?2)  be  any  two  members  of  R.  Let  A  be  the  set  of 
all  rational  numbers  of  the  form  xy,  where  x  is  in  Ai  and  y  is  in  A2,  and 
let  B  be  the  set  of  all  remaining  rational  numbers.  Show  why  it  would  be 
inadequate  to  define  the  product  of  (Ti|i?i)  and  (T2|52)  to  be  (d|Z?). 

7.5.3  Show  that  for  any  member  ( A\B )  of  R 


(a)  (A 

(b)  (A 


B)  +  (M\P)  =  ( A\B ), 
B)(M\P)  =  (M\P). 


7.5.4  Definition.  We  say  (A\B)  >  (C\D)  if  and  only  if  the  set  of  rational 
numbers  C  is  a  proper  subset  of  the  set  of  rational  nembers  A.  Show  that 


(a)  if  ( A 

(b)  if  (A 

(c)  if  ( A 


B)  > 

B) 

B ) 


> 

> 


(C 

(C 

(C 


D )  and  (C|£>)  >  (£  F),  then  (A\B)  >  ( E\F ). 

D),  then  (A\B)  +  (E|F)  >  (C|D)  +  (£|F). 

D)  and  (E\F)  >  ( G\H ),  then  {A\B)  +  (E\F)  > 


(CD)  +  (CD). 


(d)  if  (A 

(e)  if  (A 


B )  > 


B)  > 


B){E\F)  >  (C\D)(E 
D)(E\F)  >  ( A\B)(E 


{CD)  and  (E\F)  >  {M  P),  {A 
(C  D)  and  (M\P)  >  (E  F),  (C 

7.5.5  Supply  proofs  for  the  following  theorems  of  Section  7.5: 

(a)  Theorem  43.  (e)  Theorem  47. 

(b)  Theorem  44.  (f)  Theorem  48. 

(c)  Theorem  45.  (g)  Theorem  49. 

(d)  Theorem  46. 


F). 

F). 


7.5.6  Definition.  We  call  the  complex  number  [a,  —b]  the  complex  con¬ 
jugate  of  the  complex  number  [a,b].  Show  that 

(a)  [a,  —6]  [a, b]  =  [ a 2  +  b 2,  0], 

(b)  the  equation  [r,<7]  [x,y]  =  [a,b]  always  has  a  solution  if  c  ^  0  or 
d  9^  0. 

7.5.7  Show  that 

(a)  [ri  cos  di,  n  sin  di ]  [r2  cos  02,  r2  sin  02]  =  [r  cos  6 ,  r  sin  6 ],  where 
r  =  rrr2  and  d  =  6i  -}-  d2. 

(b)  [r  cos  6,  r  sin  d]n  =  [rn  cos  nd,  rn  sin  nd],  where  n  is  a  positive  integer. 
(This  is  known  as  De  Moivre’s  theorem.) 


_  CHAPTER  EIGHT 

SETS 

8.1  Sets  and  Their  Basic  Relations 
and  Operations 

The  most  important  and  most  basic  term  to  be  found  in  modern 
mathematics  and  logic  is  that  of  set ,  or  class.  Although  some  modern  studies 
make  a  technical  distinction  between  set  and  class,  we  do  not  do  so  in  this 
treatment;  it  is  recognized,  however,  that  mathematicians  are  inclined  to 
use  the  word  set ,  whereas  logicians  frequently  refer  to  a  class. 

The  modern  mathematical  theory  of  sets  is  one  of  the  most  remarkable 
creations  of  the  human  mind.  Because  of  the  unusual  boldness  of  some  of  the 
ideas  found  in  its  study,  and  because  of  some  of  the  singular  methods  of 
proof  to  which  it  has  given  rise,  the  theory  of  sets  is  indescribably  fascinating. 
But  above  this,  the  theory  has  assumed  tremendous  importance  for  almost 
the  whole  of  mathematics.  It  has  enormously  enriched,  clarified,  extended, 
and  generalized  many  domains  of  mathematics,  and  its  influence  on  the 
study  of  the  foundations  of  mathematics  has  been  profound.  The  theory  of 
sets,  we  shall  see,  forms  one  of  the  connecting  links  between  mathematics 
on  the  one  hand  and  philosophy  and  logic  on  the  other. 


239 


240  SETS 


The  reader  can  hardly  have  failed  to  notice  the  increasing  use  made  in 
the  past  chapters  of  the  word  “set.”  In  each  instance  the  set  under  considera¬ 
tion  was  some  collection  or  aggregate  of  objects.  In  practice  it  is  common 
to  describe  a  set  by  itemizing  the  elements,  or  members,  that  comprise  it 
or  by  specifying  a  property  that  each  element  of  the  set  possesses  and  that 
objects  which  are  not  elements  of  the  set  do  not  possess. 

It  now  is  desirable  that  we  discuss  sets  in  general,  and  that  we  examine 
some  of  the  relations  that  may  exist  between  sets  and  some  of  the  operations 
that  may  be  performed  on  sets.  We  do  this  in  an  intuitive  manner  in  the 
present  section.  In  the  following  sections  we  shall  consider  the  advisability 
of  a  postulational  approach  to  the  study  of  sets,  and  we  shall  touch  upon 
some  of  the  matters  mentioned  above. 

We  shall,  then,  think  of  a  set  as  simply  a  collection  of  definite  distinct 
objects  of  our  perception  or  thought.  The  objects  which  make  up  a  set  will 
be  called  elements  of  the  set,  and,  for  the  time  being,  we  shall  put  no  restriction 
on  the  nature  of  these  objects.  *  We  may,  for  example,  consider  the  set 
consisting  of  the  men,  Brown,  Smith,  and  Jones,  or  the  set  consisting  of  a 
pencil,  a  chair,  and  a  tree,  or  the  set  consisting  of  the  integers  1,  2,  and  3. 
Each  of  these  sets  contains  three  elements.  Sets  may,  however,  be  of  any  size 
insofar  as  the  number  of  elements  is  concerned.  Thus  we  may  speak  of  the 
set  of  all  stars  of  magnitude  not  greater  than  six,  or  of  the  set  of  all  natural 
numbers;  the  former  set,  though  large,  contains  a  finite  number  of  elements, 
while  the  latter  set  contains  infinitely  many  elements.!  Another  example 
of  a  set  containing  infinitely  many  elements  is  the  set  of  all  points  on  a  given 
line  segment.  We  even  frequently  find  it  convenient  to  consider  a  set  con¬ 
taining  no  elements,  such  as  the  set  of  all  kings  of  the  United  States,  or  the 
set  of  all  even  prime  numbers  greater  than  10;  such  a  set  will  be  referred  to 
as  a  null ,  or  empty ,  set.  If  a  is  an  element  of  set  A,  we  write  a  A  (read, 
“a  is  an  element  of  A”),  and  if  a  is  not  an  element  of  set  A,  we  write  a  &  A 
(read,  “a  is  not  an  element  of  T”).  We  now  introduce  some  definitions  and 
some  further  notation.  The  first  two  definitions  concern  two  important 
dyadic  relations  that  may  exist  between  pairs  of  sets. 

Definition  1.  Two  sets  A  and  B  are  said  to  be  equal ,  and  we  write 
A  =  B,  if  and  only  if  every  element  of  A  is  an  element  of  B  and  every  element 
of  B  is  an  element  of  A. 

It  is  important  to  note  that  the  order  in  which  we  name  the  elements  of 
a  set  is  immaterial.  Thus  the  setsj  {Brown,  Smith,  Jones)  and  { Smith,  Jones, 
Brown)  are  equal,  whereas  the  sets  {Brown,  Smith,  Jones)  and  {Black, 
Smith,  Jones)  are  not  equal.  Equality  of  sets  is  an  identical  equality,  although 

*See,  however,  Section  9.4. 

tExplicit  definitions  of  finite  and  infinite  sets  will  be  given  in  Section  8.3. 

+Sets  are  frequently  designated  by  exhibiting  their  elements  within  braces. 
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the  identity  of  the  elements  in  the  two  sets  may  at  time  be  disguised,  as  in 
the  two  equal  sets  A  —  {the  smallest  prime  number,  the  first  prime  number 
after  15,  the  fourth  prime  number}  and  B  —  {the  positive  root  of  *2  —  5x 
—  14  =  0,  24  -j-  1,  the  first  even  positive  integer}.  We  also  see,  by  Definition 
1,  that  all  empty  sets  are^equal.  It  is  customary  to  denote  this  unique  empty 
set  by  0. 

Definition  2.  A  set  A  is  said  to  be  a  subset  of  a  set  B  if  and  only  if 
every  element  of  set  A  is  an  element  of  set  B;  we  write  A  C  B  (read,  “ A 
is  included  in  B ”). 

For  example,  the  set  {Brown,  Smith,  Jones}  is  included  in  the  set 
{Black,  Brown,  Smith,  Jones}.  It  is  to  be  noted  that  every  set  is  included  in 
itself.  The  situation  where  A  C  B  and  there  is  at  least  one  element  of  B 
which  is  not  an  element  of  A  is  indicated  by  saying  that  A  is  a  proper  subset 
of  B.  *  In  the  example  above,  the  first  set  is  a  proper  subset  of  the  second  one. 
By  convention,  the  null  set  0  is  regarded  as  a  subset  of  every  set,  and  is  a 
proper  subset  of  every  set  except  itself.  It  is  apparent  that  the  inclusion 
relation  is  a  transitive  relation;  that  is,  if  A  CL  B  and  B  C  C,  then  A  d  C 

The  next  two  definitions  concern  two  binary  operations  on  sets. 

Definition  3.  The  union  of  two  sets  A  and  B ,  written  A  VJ  B,  and 
sometimes  read  “A  cup  B ,”  is  the  set  of  all  elements  which  belong  to  A  or 
to  B  or  to  both  A  and  B. 

Definition  4.  The  intersection  of  two  sets  A  and  B ,  written  A  4A  B, 
and  sometimes  read  “A  cap  B,”  is  the  set  of  all  elements  which  belong  to 
both  A  and  B. 

Thus,  if  A  =  {Brown,  Smith,  Jones}  and  B  =  {Black,  Smith,  Brown}, 
then  A  \J  B  =  {Black,  Brown,  Smith,  Jones},  and  A  CT  B  =  {Brown, 
Smith} .  It  can  be  shown  that  each  of  these  binary  operations  is  commutative 
and  associative,  and  each  is  distributive  with  respect  to  the  other. 

In  many  discussions  it  is  convenient  to  regard  all  the  sets  with  which 
one  may  be  concerned  as  subsets  of  an  over-all  embracing  set  /;  /  is  then 
known  as  the  universe.  In  such  situations  the  following  unary  operation  on 
sets  is  useful. 

Definition  5.  The  complement  of  a  set  A  relative  to  a  universe  /  is  the 
set  of  all  elements  of  I  which  are  not  elements  of  A.  When  the  universe  /  is 
clearly  understood,  we  denote  the  complement  of  A  by  A',  f 

For  example,  suppose  the  universe  /  =  {Black,  Brown,  Smith,  Jones} 
and  A  =  {Brown,  Smith}.  Then  A'  =  {Black,  Jones}.  It  is  evident  that  in 
general  the  complement  of  a  set  depends  strongly  on  the  universe  /;  on 

*Some  authors  use  the  notation  A  C  B  to  indicate  that  A  is  a  subset  of  B,  and  reserve 

A  C  B  to  mean  that  A  is  a  proper  subset  of  B. 

fOther  notations  for  the  complement  include  ~A,  —A,  I  —  A,  C(A). 
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the  other  hand,  no  matter  what  set  is  chosen  for  7,  it  is  easily  seen  that 
/'  =  0  and  0'  =  7. 

Arbitrary  sets  may  be  conveniently  represented  graphically  by  so-called 
Venn  diagrams,*  which  are  much  like  the  Euler  diagrams  described  in 
Section  1.2.  In  this  representation,  sets  are  pictured  as  subregions  of  some 
fixed  region  chosen  to  represent  the  universe  7.  The  sets  A  \J  B,  A  C\  B, 
and  A'  are  represented  in  Venn  diagrams  by  the  shaded  regions  in  Figures 
32,  33,  and  34,  respectively.  Here  7  is  the  indicated  square  that  includes  A 
and  B  as  subsets. 


M|§^ 

A  ^fillip 

\ 

Jfljjj 

FIG.  32 

FIG.  33 

FIG.  34 

There  are  many  relationships  among  sets  which  involve  the  operations 
of  union,  intersection,  and  complementation.  Such  relationships  often  may  be 
neatly  tested  by  means  of  Venn  diagrams.  Consider,  for  example,  the  validity 
of  the  relationship  (A  \J  B)'  =  A'  C\  B'. 


FIG.  35 


In  the  diagram  of  Figure  35,  the  horizontally  shaded  region  represents  A', 
and  the  vertically  shaded  region  represents  B' .  It  follows  that  the  cross- 
hatched  region  then  represents  A'  77  77,  the  intersection  of  A'  and  B'. 
But  this  cross-hatched  region  also  represents  the  complement  of  the  union 
of  A  and  B.  We  have  thus  verified  the  relationship. 

*John  Venn  employed  the  device  in  1876  in  a  paper  entitled  “Boole’s  logical  system.” 
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A  test  of  a  reiationship  among  sets  by  Venn  diagrams  is  not,  of  course, 
an  acceptable  proof  of  the  concerned  relationship,  for,  though  the  test  is 
convincing  to  our  common  sense,  it  is  merely  an  inductive  procedure  and 
is  therefore  technically  impermissible  as  proof. 

A  proof  of  a  relationship  among  sets  may  be  accomplished  by  considering 
separately  each  possible  case  of  an  element  e  of  the  universe  I.  Thus,  for 
the  relationship  considered  above,  there  are  four  cases,  for  an  element  e  of  I 
may  (1)  lie  in  both  A  and  B,  (2)  lie  in  A  but  not  in  B,  (3)  lie  in  B  but  not 
in  A,  (4)  lie  in  neither  A  nor  B.  Let  us  consider  the  four  cases  in  turn. 

1.  e  €  A  and  e  €  B.  Then  e  £  A'  and  e  &  B',  whence  e  &  A'  C\  B' .  Also, 
e  €  A  O'  B,  whence  e  £  {A  B)'. 

2.  e  €  A  but  e  £  B.  Then  e  £  A’  and  hence  e  \ t  A’  C\  B' .  Also,  e  €  A  O  B, 
whence  e  &  (A  VJ  B)r . 

3.  e  €  B  but  e  £  A.  Then  e  1 1  B'  and  hence  e  &  A'  C\  B'.  Also,  e  €  A  O'  B, 
whence  e  &  (A  O'  B)'. 

4.  e  \ t  A  and  e  £  B.  Then  e  €  A'  and  e  €  B\  whence  e  €  A'  O  Bf .  Also, 
e  £  A  O  B,  whence  <?  €  (A  0  B)f . 


It  now  follows  that  if  e  is  an  element  of  (A  U  B)',  then  e  is  an  element 
of  A'  H  B',  and  if*  is  an  element  of  A'  H  B',  then  *  is  an  element  of  (A  O  B)', 
whence  (A  O  B)'  =  A'  B'.  We  shall  see  in  the  next  chapter  that  this 
method  of  considering  cases  is  essentially  “the  truth  table”  method  of  logic. 

Another  method  of  proving  relationships  among  sets  is  to  employ  the 


definitions  of  union,  intersection,  and  complementation.  Thus,  considering 
the  same  relationship  as  above,  we  note  that  “*  is  in  (A  VJ  B )'”  means 
“ e  is  not  in  A  or  B ,”  or,  “*  is  in  neither  A  nor  5,”  and  “*  is  in  A'  C\  B 
means  "*  is  in  A'  and  in  or,  “*  is  in  neither  A  nor  5.”  This  approach 
assumes,  as  is  customary  in  mathematical  reasoning,  the  basic  properties 
of  the  words  “and,”  “or,”  and  “not.” 


There  is  still  another  method  of  establishing  relationships  among  sets, 
namely,  the  postulational  method.  In  this  method,  which  mathematicians 
generally  prefer,  we  assume  certain  relationships  among  sets  as  basic  and 
then  deduce  all  other  relationships  from  these  basic  ones.  This  approach, 
when  applied  to  a  certain  portion  of  set  theory,  gives  rise  to  a  remarkable 
branch  of  abstract  mathematics  known  as  Boolean  algebra,  to  which  we 


now  turn. 


8.2  Boolean  Algebra 

Boolean  algebra  is  a  postulational  abstraction  of  the  theory  of  the  subsets 
of  a  given  set  discussed  in  the  last  section.  Since  the  algebra  is  developed  by 
the  axiomatic  method,  it  is  conceivable  that  there  may  be  intei  pretations 
other  than  that  which  originated  the  subject;  this  is  indeed  the  case.  There 


244  SETS 


are  few  branches  of  mathematics  which  have  received  more  diverse  postu- 
lational  treatments  than  has  Boolean  algebra,  so  that  today  a  student  has  a 
choice  of  a  wide  variety  of  approaches  to  the  study.  The  postulate  set  that 
we  shall  employ  is  a  well-known  set  given  by  E.  V.  Huntington  in  1904.* 

Postulate  set  for  Boolean  algebra 

A  Boolean  algebra f  is  a  set  B  of  elements  a,  b,  c,  •  •  •,  with  two  binary 
operations  VJ  and  f\  called  cup  and  cap ,  satisfying  the  following  postulates. 

B1 :  Each  of  the  operations  and  T\  is  commutative;  that  is, 
a  U  b  =  b  U  a  and  a  T\  b  =  b  f~\  a 
for  all  elements  a,  b  o/B. 

B2:  There  exist  distinct  identity  elements ,  z  and  u,  relative  to  the  operations 
W  and  T\,  respectively;  that  is, 

a  W  z  =  a,  a  Pi  u  =  a,  z  3^  u 
for  each  element  a  of  B. 

B3:  Each  operation  is  distributive  relative  to  the  other;  that  is, 

a  VJ  (b  T\  c)  =  (a  U  b)  H  (a  U  c), 

a  H  (b  \J  c)  =  (a  C\  b)  \J  (a  C\  c). 

B4:  For  each  element  a  of  B  there  exists  an  element  a'  of  B  such  that 

a  yj  a'  =  u  and  a  P  a'  =  z. 

It  is  easily  verified  that  the  subsets  of  a  given  set  (as,  for  example,  the 

subsets  of  a  set  of  five  chairs),  under  the  binary  operations  of  union  and 

intersection  and  the  unary  operation  of  complementation,  satisfy  the  postu¬ 
lates  of  a  Boolean  algebra.  In  this  interpretation,  the  various  subsets  of  the 
given  set  are  identified  with  the  elements  a,  b,  c,  •  •  •  of  the  Boolean  algebra, 
the  empty  set  0  is  identified  with  z,  and  the  universe  I  (the  given  set)  is 
identified  with  u.  Since  the  postulate  set  possesses  a  concrete  interpretation, 
it  follows  that  the  postulate  set  is  absolutely  consistent. 

One  of  the  first  things  one  notices  about  the  above  postulate  set  is  that 
there  is  in  it  a  perfect  symmetry  between  the  two  operations  of  cup  and  cap. 
That  is,  if  the  operations  VJ  and  f~\,  and  the  identity  elements  z  and  u,  should 
be  interchanged  throughout  the  postulate  set,  we  would  obtain  the  postulate 
set  all  over  again.  There  exists  in  Boolean  algebra,  then,  a  principle  of  duality, 
much  like  the  principle  of  duality  that  we  observed  in  plane  projective 

*E.  V.  Huntington  [2]  (Bibliography). 

fThe  term  “Boolean  algebra”  has  come  to  be  used  in  the  literature  in  two  senses, 
namely,  (1)  as  any  interpretation  of  a  certain  postulate  set,  and  (2)  as  the  resulting 
system  of  symbolic  algebra  following  from  this  postulate  set.  The  double  usage  causes 
no  confusion,  and  the  particular  use  intended  is  always  clear  from  the  context. 
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geometry  in  Section  4.5.  We  state  this  principle  of  duality  as  the  first  theorem 
in  our  partial  development  of  Boolean  algebra. 

Theorem  1  (principle  of  duality).  Any  theorem  of  Boolean  algebra  remains 
valid  if  the  operations  KJ  and  /O,  and  the  identity  elements  z  and  u,  are  interchanged 
throughout  the  statement  of ^ the  theorem. 

This  theorem  is  a  theorem  about  theorems  and  accordingly  comes 
under  the  subject  of  metamathematics.  It  permits  us  to  state  the  theorems  of 
Boolean  algebra  in  dual  pairs  and  guarantees  that  the  proof  of  one  statement 
of  a  dual  pair  is  sufficient  for  the  establishment  of  both  statements.  We  shall 
now  deduce  a  number  of  theorems  of  Boolean  algebra,  observing  the  fact 
that  a  separate  demonstration  is  not  necessary  for  the  dual  of  a  theorem  that 
is  proved. 


Theorem  2  (the  idempotent  laws).  a'Oa  =  a;a/Oa  =  a. 


a 


=  a  \J  z 

(by  B2) 

=  a  VO  (a  /O  a') 

(by  B4) 

=  (a  a)  r\  (a  O  a') 

(by  B3) 

—  (a  VJ  a)  /O  u 

(by  B4) 

=  a  W  a. 

(by  B2) 

-  u;  a  C\  z  =  z. 

=  a  C\  a' 

(by  B4) 

=  a  C\  (a'  W  z) 

(by  B2) 

=  (a  H  a')  W  (a  O  z) 

(by  B3) 

=  W  (a  no 

(by  B4) 

=  (a  n  z)  O  Z 

(by  Bl) 

=  a  C\  Z. 

(by  B2) 

•ption  laws),  a  C\  (a  \J  b)  —  a; 

a  \J  (a  44  b)  =  a. 

==  a  KJ  z 

(by  B2) 

=  a  \J  (b  H  z) 

(by  Th.  3) 

=  (a^J  b)  n  (fl  O  z) 

(by  B3) 

=  (a  VJ  b)  a 

(by  B2) 

=  a  n  (a  u  6). 

(by  Bl) 

>ciative  laws),  a  W  (b  O  c) 

=  (a  b)  VJ  c; 

n  c. 

andjy  =  (a  b)  c.  Then 

=  (fl  n  a)  u  [fln(iUf)] 

(by  B3) 

=  a  VJ  [a  C\  (b  \J  r)] 

(by  Th.  2) 

=  a, 

(by  Th.  4) 

=  n  (fl  w  6)]  u  (fl  n  c) 

(by  B3) 

=  a  (a  hA  c) 

(by  Th.  4) 

—  a. 

(by  Th.  4) 

and 
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Therefore  a  C\  x  —  a  C\  y.  Now 


a'  C\  x  =  (a'  C\  a)  \J  [a'  (b  VJ  c)]  (by  B3) 

=  (a  C\  a')  VJ  [a'  C\  (b  \J  c)  ]  (by  Bl) 

=  ^  VJ  [a'  n  {b  \J  r)]  (by  B4) 

=  W  (~\  (b\J  c)]^J  z  (by  Bl) 

=  a'  r\  (b^J  c),  (by  B2) 

and 

ar  C\  y  =  [a'  C\  {a  VJ  b)]  L7  (a'  r\  c)  (by  B3) 

=  [(a'  n  a)  U  (a'  ‘H  i)]  U  (a'  H  c)  (by  B3) 

=  [(a  n  a')  W  (a'  O  6)]  W  (fl'  H  r)  (by  Bl) 

=  [Z  W  (a'  n  A)]  VJ  (fl'  n  r)  (by  B4) 

=  [(a'  H  ft)  U  *]  KJ  (a'  H  r)  (by  Bl) 

=  ( a '  r\  b)  KJ  ( a '  O  c)  (by  B2) 

=  a'  n  (b^J  c).  (by  B3) 

Therefore  a'  C\  x  —  af  C\  y.  Hence 


{a  r\  x)  KJ  ( a '  r\  x)  —  (a  C\  y)  \J  ( a '  b~^y),  (substitution) 


(x  r\  a)  VJ  (x  r\  a')  =  {y  C\  a)  \J  (y  4T  a'),  (by  Bl) 

x  r\  (a  yj  a')  —  y  C\  {a  \J  a '),  (by  B3) 

x  C\  u  =  y  C\  u,  (by  B4) 

x  =  y.  (by  B2) 

Theorem  6.  a'  is  unique. 


Let  b  and  c  be  elements  of  B  such  that 


a  \J  b  =  u, 

Now 


and,  in  the  same  way, 


Therefore  b  = 


a  C\  b  =  z;  a  \J  c  =  u, 

c  =  c  r\  u 
—  c  r\  (a  yj  b) 

=  (c  43  a)  yj  (c  L3  b) 
=  (a  n  c)  KJ  (b  n  c) 

=  zu  (b  r\c) 

=  (b  r^c)yj  z 

=  b  r\  c, 


a  C\  c  =  z. 

(by  B2) 
(substitution) 
(by  B3) 
(by  Bl) 
(by  substitution) 
(by  Bl) 
(by  B2) 


b  =  c  r\  b 

—  b  C\  c.  (by  Bl) 

c,  and  the  theorem  follows  by  reference  to  the  definition  of  a'. 


Theorem  7  (De  Morgan’s  laws),  (a  VJ  b)'  =  a '  C\  b';  (a  fN  b)'  =  a '  KJ  b'. 


(aVJb)VJ  (a'  n  b')  =  [(a  V_7  b)  \J  a']  H  [(a  \J  b)  \J  b']  (by  B3) 

=  [fl,U(flUJ)]n[(flUi)Ui']  (by  Bl) 

=  [(T  W  a)  VJ  b]  r\  [aU  (byj  b')]  (by  Th.  5) 

=  [b'U(a'Ua')]ni[a'U(b'Jb')]  (by  Bl) 

=  (b  W  u)  C\  (a  VJ  u)  (by  B4) 

=  u  C\u  (by  Th.  3) 

=  w,  (by  B2) 
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and 


{aVJb)C\  ( a '  C\  br)  =  ( a '  C\  b')  r\(a\Jb) 

(by  Bl) 

=  [( a ' n b ') n«]u  [K r\j')ru] 

(by  B3) 

=  [jAKra'ilu^rn'ini] 

(by  Bl) 

(by  Th.  5) 

=  [b' l~\  (a  C\  a')]V)  [a' C\  (b  C\  b')] 

(by  Bl) 

=  ( b '  r\z)^J  («'  o  z) 

(by  B4) 

=  z^J  z 

(by  Th.  3) 

= 

(by  B2) 

Therefore,  by  Theorem  6,  (a'  b')  =  (a  VJ  <>)'. 

As  yet  nothing  has  been  said  about  a  relation  in  Boolean  algebra 

corresponding  to  the  relation  of  inclusion,  that  plays  such  a 
role  in  the  theory  of  sets.  We  now  introduce  such  a  relation. 

fundamental 

Definition  1.  If  a  and  b  are  elements  of  B,  we  write  a  C  b  if  and  only 

if  a  yj  b  =  b,  and  we  say  “a  is  contained  in  A” 

Theorem  8.  a  C  a. 

For,  by  Theorem  2,  we  have  aCJ  a  —  a.  Hence,  by  Definition  1,  a  C 

Theorem  9.  If  a  C  b  and  b  C  a,  then  a  —  b. 

For 

a  VJ  b  =  b  and  b  O'  a  —  a. 

(by  Def.  1) 

But 

aCJ  b  =  bZJ  a. 

(by  Bl) 

Hence 

a  —  b. 

(substitution) 

Theorem  10.  If  a  C  t>  and  b  C  c,  then  a  C  c. 

For 

a  \J  b  =  b  and  b  O  c  =  c. 

(by  Def.  1 ) 

Therefore 

a  yj  c  =  a^J  {b  yj  c) 

(substitution) 

=  (aCJ  b)ZJ  c 

(by  Th.  5) 

=  b  O'  c 

(substitution) 

= 

(substitution) 

and 

a  (Z  c. 

(by  Def.  1) 

Theorem  11.  z  (Z  a  CZ  u  for  each  dement  a  o/B. 


248  SETS 


Therefore 

Z  <Z  a. 

(by  Def.  1) 

Also 

a\J  u  —  u, 

(by  Th.  3) 

whence 

a  C  u. 

(by  Def.  1) 

Theorem  12. 

* 

If  a  C  x  and  b  C  x,  then  (a  \J  b)  C  x. 

For 

a  KJ  x  =  x  and  b  VJ  x  =  x. 

(by  Def.  1) 

Then 

(a  W  b)  KJ  x  —  a  VJ  (b  x) 

(by  Th.  5) 

—  a  yj  x 

(substitution) 

=  X, 

(substitution) 

whence 

(a  KJ  b)  C  x. 

(by  Def.  1) 

We  observe,  from  Theorems  8,  9,  and  10,  that  the  relation  C  in  Boolean 
algebra  is  reflexive,  antisymmetric,  and  transitive  (see  Section  5.5  and 
Problem  5.5.11).  The  elements  of  a  set  S  are  said  to  be  partially  ordered  with 
respect  to  a  relation  R  if  R  possesses  these  three  properties  on  S’.  It  follows 
that  the  elements  of  a  Boolean  algebra  are  partially  ordered  with  respect  to 
the  relation  O  Theorem  11  gives  universal  bounds,  relative  to  this  relation, 
for  all  elements  of  B ,  and  Theorem  12  establishes  that  a  dJ  b  is  the  least 
upper  bound  of  a  and  b. 

Of  course,  the  above  collection  of  twelve  theorems  is  not  exhaustive; 
in  fact,  mathematicians  have  deduced  a  tremendous  number  of  theorems  of 
Boolean  algebra.  The  twelve  theorems  should  serve,  however,  to  give  some 
idea  of  the  nature  of  this  algebra,  which  has,  on  the  one  hand,  much  in 
common  with  the  familiar  algebra  of  high  school,  and  yet,  on  the  other  hand, 
possesses  many  striking  differences.  The  subject  has  a  number  of  valuable 
applications.  In  the  next  chapter,  we  shall  examine  its  application  to  certain 
portions  of  logic.  Recently  the  algebra  has  found  practical  importance  in 
the  analysis  and  design  of  electrical  networks*  and  in  the  development  of  a 
study  known  as  the  theory  of  information. 

Boolean  algebra  is  named  after  the  English  logician  and  mathematician 
George  Boole  (1815-1864),  who  was  the  first  to  give  an  algebraic  formulation 
to  the  theory  of  classes  in  logic.  Since  the  algebra  originated  in  an  attempt 
to  establish  a  symbolic,  or  mathematical,  logic,  we  shall  withhold  a  fuller 

See  F.  E.  Hohn  [1].  This  paper  contains  a  bibliography  of  applications  of  Boolean 
algebra  to  electronics. 
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account  of  the  history  of  the  algebra  until  the  next  chapter,  which  is  devoted 
to  the  subject  of  logic. 

We  now  turn  to  the  application  of  the  theory  of  sets  to  numbers;  this 
work  originated  with  Georg  Cantor  (1845-1918)  toward  the  end  of  the 
nineteenth  century  ancTis  fundamental  insofar  as  the  foundations  of  mathe¬ 
matics  are  concerned. 


8.3  Sets  and  the  Foundations 

of  Mathematics 

In  Chapter  7  we  saw  how  the  great  bulk  of  mathematics  can  be  reached 
from  the  natural  number  system  in  a  purely  definitional  way,  and  thus  how 
the  consistency  of  mathematics  can  be  made  to  rest  upon  the  consistency 
of  this  basic  number  system.  It  is  natural  to  wonder  whether  the  starting 
point  of  the  definitional  development  of  mathematics  cannot  perhaps  be 
pushed  to  an  even  deeper  level.  Mathematicians  who  attempt  such  a  con¬ 
struction  usually  start  with  the  theory  of  sets,  the  concepts  of  which  are 
already  involved  in  the  postulational  development  of  the  natural  number 
system.  The  success  of  this  program  enables  one  to  define  natural  number 
in  terms  of  sets,  and  hence  to  reduce  the  number  of  undefined  terms  that 
must  be  assumed  in  mathematics.  The  program  is  far  too  extensive  for  us 
to  carry  through  in  detail,  but  we  can  indicate  the  main  ideas  of  the  develop¬ 
ment.  Of  course,  a  rigorous  treatment  along  these  lines  requires  that  we  start 
from  a  systematic  postulational  development  of  set  theory.  There  is  con¬ 
siderable  present-day  research  being  expended  on  this  task  of  building  up 
set  theory  on  a  suitable  postulational  base.  We  shall  forego  this  initial  part 
of  the  program  and  merely  sketch  how  the  natural  numbers  may  be  defined 
in  terms  of  set  concepts. 

It  is  interesting,  as  a  mere  observation  at  this  point,  that  logicians  have 
endeavored  to  push  down  still  further  the  starting  level  of  the  definitional 
development  of  mathematics,  and  to  derive  the  theory  of  sets,  or  classes, 
from  a  foundation  in  the  logic  of  propositions  and  propositional  functions. 
Undoubtedly  the  most  notable  attempt  of  this  kind  to  date  is  the  monu¬ 
mental  Principia  mathematica  of  Whitehead  and  Russell.  More  will  be  said 
about  this  remarkable  work  in  the  next  chapter. 

We  now  take  up  a  description  of  how  the  natural  numbers  may  be 
introduced  by  means  of  set  concepts.  Some  definitions  are  in  order. 

Definition  1.  Two  sets  A  and  B  are  said  to  be  in  one-to-one  correspondence 
when  we  have  a  pairing  of  the  elements  of  A  with  the  elements  of  B  such 
that  each  element  of  A  corresponds  to  one  and  only  one  element  of  B  and 
each  element  of  B  corresponds  to  one  and  only  one  element  of  A. 

There  exists,  for  example,  a  one-to-one  correspondence  between  the  set 
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of  all  letters  of  the  alphabet  and  the  set  of  the  first  26  positive  integers,  for 
we  may  make  the  pairing 

abed-  •  '  x  y  z 

1234-  •  •  24  25  26 

There  are,  of  course,  many  other  permissible  pairings  of  these  two  sets, 
such  as 

abed'  '  x  y  z 

3  4  5  6  •  •  26  1  2 

As  another  illustration,  there  exists  a  one-to-one  correspondence  between 
the  set  of  all  positive  integers  and  the  set  of  all  even  positive  integers,  for 
we  may  make  the  pairing 

1  2  3  4-  •  n 

2  4  6  8*  •  -  2n  ' 

Also,  there  exists  a  one-to-one  correspondence  between  the  set  of  points 
on  a  line  segment  AB  and  the  set  of  points  on  any  other  line  segment  A'B', 
for  we  may  make  the  pairing  suggested  in  Figure  36. 


FIG.  36 

Definition  2.  Two  set  A  and  B  are  said  to  be  equivalent ,  and  we  write 
A  ^  B,  if  and  only  if  they  can  be  placed  in  one-to-one  correspondence. 

Thus  the  set  of  all  letters  of  the  alphabet  and  the  set  of  the  first  26 
positive  integers  are  equivalent,  as  are  the  set  of  all  positive  integers  and  the 
set  of  all  even  positive  integers.  It  is  an  easy  matter,  which  we  leave  to  the 
reader,  to  show  that  ~  is  an  equivalence  relation  in  the  technical  sense; 
that  is,  the  relation  ~  is  reflexive,  symmetric,  and  transitive. 

We  now  introduce  the  concept  of  number  into  the  theory  of  sets  by 
the  following  definition: 
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Definition  3.  Two  sets  which  are  equivalent  are  said  to  have  the  same 
cardinal  number.  All  sets  that  have  the  same  cardinal  number  as  the  set  {a} 
are  said  to  have  cardinal  number  one  (or  to  contain  one  element);  all  sets 
having  the  same  cardinal  number  as  the  set  [a,  a'}  are  said  to  have  the 
cardinal  number  two  (or do  contain  two  elements);  all  sets  having  the  same 
cardinal  number  as  the  set  { a, a', a"}  are  said  to  have  cardinal  number  three 
(or  to  contain  three  elements);  and  so  on.  We  shall  denote  the  cardinal 
numbers  one,  two,  three,  •  •  •  by  1,  2,  3,  •  •  •.* 

Cardinal  numbers  thus  appear  as  characters  of  sets.  In  other  words, 
a  triad  of  apples  and  a  triad  of  pears  have  a  property  in  common  which  we 
denote  by  “three.”  Gottlob  Frege  in  1879  and  Bertrand  Russell  in  1901, 
as  have  most  other  logicians  who  succeeded  them,  employed  the  idea  of 
Definition  3  in  basic  fashion  when  they  defined  the  cardinal  number  of  a 
set  S  as  the  set  of  all  sets  equivalent  to  set  S.  Thus,  according  to  this  defini¬ 
tion,  three  is  the  set  of  all  sets  each  of  which  contains  a  triad  of  members. 

Definition  4.  A  nonempty  set  is  said  to  be  finite  if  and  only  if  its  cardinal 
number  is  one  of  the  cardinal  numbers  1,  2,  3,  •  •  • .  A  set  which  is  not  empty 
or  finite  is  said  to  be  infinite. 

Definition  5.  Let  A  be  a  set  of  cardinal  number  a  and  let  B  be  a  set  of 
cardinal  number  (3 ,  and  let  us  distinguish  the  elements  of  B  from  those  of 
A  so  that  sets  A  and  B  can  be  considered  as  having  no  elements  in  common; 
that  is,  A  r\  B  =  0.  Then,  by  a  -j-  /3,  called  the  sum  of  a  and  (3,  we  mean  the 
cardinal  number  of  the  set  A  L7  B.  This  binary  operation  on  cardinal 
numbers  is  called  addition. 

Theorem  1.  Addition  of  cardinal  numbers  is  commutative  and  associative. 

These  properties  follow  immediately  from  the  commutativity  and 
associativity  of  the  union  operation  on  sets. 

Definition  6.  The  set  C  whose  elements  are  all  ordered  pairs  (a,b), 
where  a  €  A  and  b  €  B,  is  called  the  Cartesian  product  of  A  and  B,  and  is 
denoted  by  A  X  B. 

Theorem  2.  If  A,  B,  C  are  sets ,  then  A  X  (B  U  C)  =  (A  X  B)  \J 
(A  X  C). 

For  let  (a,d)  be  an  element  of  A  X  {B  L7  C).  Then  a  €  A  and  d  € 
(B  O  C).  Since  d  €  {B  \J  C),  it  follows  that  d  €  B  or  d  €  C,  whence  (a,d) 
is  an  element  of  A  X  B  or  of  A  X  C,  that  is,  of  ( A  X  B)  W  {A  X  C). 
Conversely,  suppose  (a,d)  is  an  element  of  (A  X  B)  KJ  (A  X  C).  Then 
a  (E  A,  and  d  €  B  or  d  (E  C.  Since  d  €  B  or  d  €  C,  it  follows  that  d  (E  (B\J  C), 
whence  (a,d)  is  an  element  of  A  X  (W  C).  It  now  follows  that  A  X  (B  VJ  C) 
=  (A  X  fi)  W  (A  X  C). 

*This  definition  presumes  ability  to  recognize  distinct  objects. 
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Definition  7.  Let  A  be  a  set  of  cardinal  number  a  and  let  B  be  a  set  of 
cardinal  number  ft.  Then,  by  aft,  called  the  product  of  a  and  ft,  we  mean  the 
cardinal  number  of  the  set  A  X  B.  This  binary  operation  on  cardinal 
numbers  is  called  multiplication. 

Theorem  3.  Multiplication  of  cardinal  numbers  is  commutative,  associative, 
and  distributive  with  respect  to  addition  of  cardinal  numbers. 

The  commutative  property  holds,  since  the  correspondence  (a,b)  <— > 
(b,a),  where  a  €  A  and  b  €  B,  shows  that  A  X  B  ~  B  X  A.  The  associative 
property  holds,  since  the  correspondence  (a,(b,c))  <-»  ((a,b),c),  where  a  €  A, 
b  €  B,  and  c  €  C,  shows  that  A  X  (B  X  C)  ~  (A  X  B)  X  C.  Finally,  the 
distributive  property  follows  from  Theorem  2,  for  if  B  and  C  have  no  ele¬ 
ments  in  common  and  if  a,  ft,  y  are  the  cardinal  numbers  of  A,  B,  C,  then 
the  cardinal  number  of  A  X  (B  W  C)  is  a  (ft  +  7),  and  the  cardinal  number 
of  (A  X  B)  \J  (A  X  C)  is  aft  -f-  ay. 

Theorem  4.  For  any  cardinal  number  a,  \a  =  a. 

The  proof  of  this  theorem  is  quite  trivial. 

Theorems  1,  3,  and  4  hold  for  the  cardinal  numbers  of  both  finite  and 
infinite  sets.  These  theorems  cover  the  properties  of  Postulates  N1  through 
N6  for  the  natural  number  system  as  given  in  Section  7.3.  The  properties 
of  Postulates  N7  through  N10  do  not  hold  for  the  cardinal  numbers  of 
infinite  sets  but  can  be  shown  to  hold  for  those  of  finite  sets,  though  we  shall 
not  undertake  this  task.  It  follows  that  the  cardinal  numbers  1,  2,  3,  •  •  •, 
under  the  above  operations  of  addition  and  multiplication,  satisfy  all  the 
postulates  for  the  natural  number  system.  Since  (assuming  consistency  of 
the  postulate  set)  any  two  possible  interpretations  of  this  postulate  set  must 
be  isomorphic,  we  may  identify  the  set  of  cardinal  numbers  1,  2,  3,  •  •  •  with 
the  set  of  natural  numbers.* 

8.4  Infinite  Sets  and  Transfinite 
Numbers 

In  the  last  section  we  saw  that  the  cardinal  numbers  of  finite  sets  may  be 
identified  with  the  natural  numbers.  In  this  section  we  consider  some  of  the 
properties  of  the  cardinal  numbers  of  infinite  sets.  These  cardinal  numbers 
are  known  as  transfinite  numbers,  and  their  theory  was  first  developed  by 
Georg  Cantor  in  a  remarkable  series  of  articles  beginning  in  1872,  and  pub¬ 
lished,  for  the  most  part,  in  the  German  mathematics  journals  Mathematische 
Annalen  and  Journal  fur  Mathematik.  Prior  to  Cantor’s  study  mathematicians 
accepted  only  one  infinity,  denoted  by  some  symbol  like  °° ,  and  this  symbol 

*For  a  development  of  the  real  number  system  via  the  cardinal  numbers  of  set  theory, 
written  at  the  college  freshman  level,  see  H.  Levi  [1];  also  see  N.  J.  Lennes.  For  a 
very  compact  postulational  treatment  starting  with  set  theory,  see  W.  J.  Thron, 
Sections  1  and  2. 
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was  employed  indiscriminately  to  indicate  the  “number”  of  elements  in 
such  sets  as  the  set  of  all  natural  numbers  and  the  set  of  all  real  numbers. 
With  Cantor’s  work,  a  whole  new  outlook  was  introduced,  and  a  scale  and 
arithmetic  of  infinities  was  achieved. 

The  basic  principle  that  equivalent  sets  are  to  bear  the  same  cardinal 
number  presents  us  with  many  interesting  and  intriguing  situations  when 
the  sets  under  consideration  are  infinite  sets.  Galileo  Galilei  observed  as 
early  as  the  latter  part  of  the  sixteenth  century  that  the  set  of  all  positive 
integers  can  be  placed  in  one-to-one  correspondence  with  the  set  of  all  posi¬ 
tive  even  integers,  as  was  depicted  in  the  last  section.  Hence,  the  same 
cardinal  number  should  be  assigned  to  each  of  these  sets,  and,  from  this  point 
of  view,  we  must  say  that  there  are  as  many  even  positive  integers  as  there 
are  positive  integers  in  all !  It  is  observed  at  once  that  the  Euclidean  postulate 
which  states  that  the  whole  is  greater  than  a  part  cannot  be  tolerated  when 
cardinal  numbers  of  infinite  sets  are  under  consideration.  In  fact,  Dedekind, 
in  about  1888,  actually  defined  an  infinite  set  to  be  one  which  is  equivalent 
to  some  proper  subset  of  itself.  The  simplicity  of  applying  this  definition  in 
many  circumstances  is  illustrated  by  Galileo’s  observation,  namely,  that  the 
set  of  all  positive  integers  is  equivalent  to  a  proper  subset  of  itself.  It  can  be 
shown*  that  this  definition  of  Dedekind’s  is  equivalent  to  the  definition  of 
infinite  set  given  in  Section  8.3. 

We  shall  designate  the  cardinal  number  of  the  set  of  all  natural  numbers 
by  d,  f  and  describe  any  set  having  this  cardinal  number  as  being  denumerable. 
It  follows  that  a  set  S  is  denumerable  if  and  only  if  its  elements  can  be  written 
as  an  unending  sequence  {h,  ^2,  *  *  • }.  Since  it  is  easily  shown X  that  any 

infinite  set  contains  a  denumerable  subset,  it  follows  that  d  is  the  “smallest” 
transfinite  number. 

Cantor,  in  one  of  his  earliest  papers  on  set  theory,  proved  the  denumer¬ 
ability  of  two  important  sets  which  scarcely  seem  at  first  glance  to  possess 
this  property. 

The  first  set  is  the  set  of  all  rational  numbers.  This  set  has  the  important 
property  of  being  dense.  By  this  is  meant  that  between  any  two  distinct 
rational  numbers  there  exists  another  rational  number— in  fact,  infinitely 
many  other  rational  numbers.  For  example,  between  0  and  1  lie  the  rational 
numbers 

1  2  3  4  5  n 

2’  3’  ?  ?  ?  ’  n  +  T  ’  *  S 

*By  employing  the  axiom  of  choice.  See  the  Appendix,  Section  A. 5. 

fCantor  designated  this  cardinal  number  by  the  Hebrew  letter  aleph  with  the  sub¬ 
script  zero,  that  is,  by 

fAgain,  by  employing  the  axiom  of  choice.  See  the  Appendix,  Section  A. 5. 
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between  0  and  1/2  lie  the  rational  numbers 

1  2  3  4  5  n 

_  5  _  5  .  5  ’  *  *  J  _  ,  ,5  *  ’  "  > 

3  5  7  9  11  2n  +  1 

between  0  and  1/4  lie  the  rational  numbers 

1  2  3  4  5  n 

5’  9’  13’  17’  21’  *  *  ’’  An  +  l’  *  ”’ 

and  so  on.  Because  of  this  property  one  might  well  expect  the  transfinite 
number  of  the  set  of  all  rational  numbers  to  be  greater  than  d.  *  Cantor 
showed  that  this  is  not  the  case,  and  that,  on  the  contrary,  the  set  of  all 
rational  numbers  is  denumerable.  His  proof  is  interesting  and  runs  as  follows. 

Theorem  1.  The  set  of  all  rational  numbers  is  denumerable. 

Consider  the  array 

l-»2  3— >4  •  •  • 

/  /  / 

12  3  4 

2  2  2  2 

12  3  4 

3  3  3  3 

/ 

1  2  3  4  •  •  • 

I 

•  •  •  •  •  •  • 

in  which  the  first  row  contains,  in  order  of  magnitude,  all  the  natural  num¬ 
bers  (that  is,  all  positive  fractions  with  denominator  1),  the  second  row 
contains,  in  order  of  magnitude,  all  the  positive  fractions  with  denominator 
2,  the  third  row  contains,  in  order  of  magnitude,  all  the  positive  fractions 
with  denominator  3,  etc.  Obviously,  every  positive  rational  number  appears 
in  this  array,  and  if  we  list  the  numbers  in  the  order  of  succession  indicated 
by  the  broken  line,  omitting  numbers  which  have  already  appeared,  we 
obtain  an  unending  sequence 

1,  2,  1/2,  1/3,  3,  4,  3/2,  2/3,  1/4,  ••• 

in  which  each  positive  rational  number  appears  once  and  only  once.  Denote 
this  sequence  by  { rh  r2,  r%,  •  •  • ) .  Then  the  sequence  { 0,  —  n,  r i,  —  r2,  r2,  •  •  • ) 


*The  cardinal  number  of  a  set  A  is  said  to  be  greater  than  the  cardinal  number  of  a  set 
B  if  and  only  if  B  is  equivalent  to  a  proper  subset  of  A,  but  A  is  equivalent  to  no 
proper  subset  of  B. 
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contains  the  set  of  all  rational  numbers,  and  the  denumerability  of  this  set 
is  established. 

The  second  set  considered  by  Cantor  is  a  seemingly  much  more  extensive 
set  of  numbers  than  the  set  of  rational  numbers.  We  first  make  the  following 
definition. 

Definition  1.  A  complex  number  is  said  to  be  algebraic  if  it  is  a  zero 
of  some  polynomial 


f(x)  =  aoxn  +  a  \xn  1  -+-  •  •  •  +  an-  \X  +  an, 


where  n  is  a  non-negative  integer,  a$  ^  0,  and  all  the  ads  are  integers.  A 
complex  number  which  is  not  algebraic  is  said  to  be  transcendental. 

It  is  quite  clear  that  the  algebraic  numbers  include,  among  others,  all 
rational  numbers  and  all  roots  of  such  numbers.  Accordingly,  the  following 
theorem  is  somewhat  astonishing: 


Theorem  2.  The  set  of  all  algebraic  numbers  is  denumerable. 

Let  /(*)  by  a  polynomial  of  the  kind  described  in  Definition  1,  where, 
without  loss  of  generality,  we  may  suppose  <20  >  0.  Consider  the  so-called 
height  of  the  polynomial,  defined  by 


h  =  n  T  <20  +  a  1  -f-  1(22  + 


T  an- 1  “b 


Obviously  h  is  an  integer  ^  1,  and  there  are  plainly  only  a  finite  number  of 
polynomials  of  a  given  height  /?,  and  therefore  only  a  finite  number  of 
algebraic  numbers  arising  from  polynomials  of  a  given  height  h.  We  may 
now  list  (theoretically  speaking)  all  the  algebraic  numbers,  refraining  from 
repeating  any  number  already  listed,  by  first  taking  those  arising  from 
polynomials  of  height  1,  then  those  arising  from  polynomials  of  height  2, 
then  those  arising  from  polynomials  of  height  3,  and  so  on.  We  thus  see 
that  the  set  of  all  algebraic  numbers  can  be  listed  in  an  unending  sequence, 
whence  the  set  is  denumerable. 

In  view  of  the  past  two  theorems,  there  remains  the  possibility  that  all 
infinite  sets  are  denumerable.  That  this  is  not  so  was  shown  by  Cantor  in 
a  striking  proof  of  the  following  significant  theorem: 

Theorem  3.  The  set  of  all  real  numbers  in  the  interval  0  <  x  <  1  is 
nondenumerable. 

The  proof  is  indirect  and  employs  an  unusual  method  known  as  the 
Cantor  diagonal  process.  Let  us,  then,  assume  the  set  to  be  denumerable.  Then 
we  may  list  the  numbers  of  the  set  in  a  sequence  \ph  p2,  ph  '  ")•  Each  of 
these  numbers  pi  can  be  written  uniquely  as  a  nonterminating  decimal 
fraction;  in  this  connection  it  is  useful  to  recall  that  every  rational  number 
may  be  written  as  a  “repeating  decimal”;  a  number  such  as  0.3,  for  example, 
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can  be  written  as  0.29999*  •*.  We  can  then  display  the  sequence  in  the 
following  array, 

p\  =  0  .<2  1 1<2 12<2 13  ’  *  ' 
p2  —  0 . <221<222<223  *  *  * 
p3  =  0. <231<232<233  *  *  * 


where  each  symbol  an  represents  some  one  of  the  digits  0,  1,  2,  3,  4,  5,  6,  7, 
8,  9.  Now,  in  spite  of  any  care  which  has  been  taken  to  list  all  the  real 
numbers  between  0  and  1,  there  is  a  number  which  could  not  have  been 
listed.  Such  a  number  is  0. 616263  ■  •  *,  where,  say,  bk  =  7  if  auk  5^  7  and 
bk  =  3  if  <2a-/c  =  7,  for  k  —  1,  2,  3,  •  •  *,  n,  •  •  *.  This  number  clearly  lies 
between  0  and  1,  and  it  must  differ  from  each  number  pi,  for  it  differs  from 
pi  in  at  least  the  first  decimal  place,  from  p2  in  at  least  the  second  decimal 
place,  from  p 3  in  at  least  the  third  decimal  place,  and  so  on.  Thus  the 
original  assumption  that  all  the  real  numbers  between  0  and  1  can  be  listed 
in  a  sequence  is  untenable,  and  the  set  must  therefore  be  nondenumerable. 

In  reference  to  the  above  proof,  it  should  be  observed  that  the  Cantor 
diagonal  process  has  not  received  the  universal  endorsement  of  mathe¬ 
maticians,  and  some  severe  indictments  of  the  process  from  the  operational 
point  of  view  have  been  made.  Nevertheless,  the  ingenious  method  has 
become  a  model  for  many  mathematical  demonstrations. 

Cantor  deduced  the  following  remarkable  consequence  of  Theorems 
2  and  3: 

Theorem  4.  Transcendental  numbers  exist. 

Since,  by  Theorem  3,  the  set  of  all  real  numbers  between  0  and  1  is 
nondenumerable,  it  is  easily  demonstrated  that  the  set  of  all  complex  num¬ 
bers  is  also  nondenumerable.  But,  by  Theorem  2,  the  set  of  all  algebraic 
numbers  is  denumerable.  It  follows  that  there  must  exist  complex  numbers 
which  are  not  algebraic,  and  the  theorem  is  established. 

Not  all  mathematicians  are  willing  to  accept  the  above  proof  of  Theorem 
4.  The  acceptability  or  nonacceptability  of  the  proof  hinges  on  what  one 
believes  mathematical  existence  to  be,  and  there  are  some  mathematicians 
who  feel  that  mathematical  existence  is  established  only  when  one  of  the 
objects  whose  existence  is  in  question  is  actually  constructed  and  exhibited. 
Now  the  above  proof  does  not  establish  the  existence  of  transcendental 
numbers  by  producing  a  specific  example  of  such  a  number.  There  are  many 
existence  proofs  in  mathematics  of  this  nonconstructive  sort,  where  existence 
is  presumably  established  by  merely  showing  that  the  assumption  of  non¬ 
existence  leads  to  a  contradiction.  Most  proofs  of  the  fundamental  theorem 
of  algebra,  for  example,  are  formulated  along  such  lines. 

Because  of  the  dissatisfaction  of  some  mathematicians  with  noncon¬ 
structive  existence  proofs,  a  good  deal  of  effort  has  been  made  to  replace 
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such  proofs  by  those  which  actually  yield  one  of  the  objects  concerned.  In 
the  Appendix,  Section  A. 4,  is  a  proof  of  this  kind  of  the  existence  of  trans¬ 
cendental  numbers. 

The  proof  of  the  existence  of  transcendental  numbers  and  the  proof  that 
some  particular  number  is  transcendental  are  two  quite  different  matters, 
the  latter  often  being  a  very  difficult  problem.  It  was  Hermite  who,  in  1873, 
provecl  that  the  number  e,  the  base  for  natural  logarithms,  is  transcendental, 
and  Lindemann,  in  1882,  who  first  established  the  transcendentality  of  the 
number  ir.  Unfortunately,  it  is  inconvenient  for  us  to  prove  these  interesting 
facts  here.  The  difficulty  of  identifying  a  particular  given  number  as  algebraic 
or  transcendental  is  illustrated  by  the  fact  that  it  is  not  yet  known  whether 
the  number  it*  is  algebraic  or  transcendental.  A  recent  gain  along  these  lines 
was  the  establishment  of  the  transcendental  character  of  any  number  of  the 
form  ab,  where  a  is  an  algebraic  number  different  from  0  or  1,  and  b  is  any 
irrational  algebraic  number.  This  result  was  a  culmination  of  an  almost 
thirty-years  effort  to  prove  that  the  so-called  Hilbert  number ,  2V2,  is  trans¬ 
cendental. 

Since  the  set  of  all  real  numbers  in  the  interval  0  <  *  <  1  is  non- 
denumerable,  the  transfinite  number  of  this  set  is  greater  than  d.  We  shall 
denote  it  by  c,  and  shall  refer  to  it  as  the  cardinal  number  of  the  continuum. 
It  has  been  generally  believed  that  c  is  the  next  transfinite  number  after  d , 
that  is,  that  there  is  no  set  having  a  cardinal  number  greater  than  d  but 
less  than  c.  This  belief  is  known  as  the  continuum  hypothesis ,  but,  in  spite  of 
the  most  strenuous  efforts,  no  proof  has  been  found  to  establish  it.  Many 
consequences  of  the  hypothesis  have  been  deduced,  and,  in  about  1940,  the 
Austrian  logician  Kurt  Godel  succeeded  in  showing  that  the  continuum 
hypothesis  is  consistent  with  a  famous  postulate  set  of  set  theory  provided 
these  postulates  themselves  are  consistent.  *  Godel  conjectured  that  the  denial 
of  the  continuum  hypothesis  is  also  consistent  with  the  postulates  of  set  theory. 
This  conjecture  was  established,  in  1963,  by  Dr.  Paul  J.  Cohen  of  Stanford 
University,  thus  proving  that  the  continuum  hypothesis  is  independent  of 
the  postulates  of  set  theory,  and  hence  can  never  be  deduced  from  those 
postulates,  f  The  situation  is  analogous  to  that  of  the  parallel  postulate  in 
Euclidean  geometry. 

It  has  been  shown  that  the  set  of  all  single-valued  functions  f(x)  defined 
over  the  interval  0  <  x  <  1  has  a  cardinal  number  greater  than  c,  but 
whether  this  cardinal  number  is  or  is  not  the  next  after  c  is  not -known. 
Cantor’s  theory  provides  for  an  infinite  sequence  of  transfinite  numbers, 
and  there  are  demonstrations  which  purport  to  show  that  an  unlimited 
number  of  cardinal  numbers  greater  than  that  of  the  continuum  actually 
exist. 

*K.  Godel  [3]. 

fP.  J.  Cohen. 
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8.5  Sets  and  the  Fundamental 

Concepts  of  Mathematics 

Following  the  work  of  Cantor,  interest  in  the  theory  of  sets  developed 
rapidly  until  today  virtually  every  field  of  mathematics  has  felt  the  impact 
of  the  new  discipline.  Notions  of  space  and  the  geometry  of  a  space,  for 
example,  have  been  completely  revolutionized  by  the  theory  of  sets.  Also, 
the  basic  concepts  in  analysis,  such  as  those  of  limit,  function,  continuity, 
derivative,  and  integral,  are  now  most  aptly  described  in  terms  of  set  theory 
ideas.  Most  important,  however,  has  been  the  opportunity  for  new  mathe¬ 
matical  developments  undreamed  of  fifty  years  ago.  Thus,  in  companionship 
with  the  new  appreciation  of  postulational  procedures  in  mathematics, 
abstract  spaces  have  been  born,  general  theories  of  dimension  and  measure 
have  been  created,  and  the  branch  of  mathematics  called  topology  has 
undergone  a  spectacular  growth.  In  short,  under  the  influence  of  set  theory, 
a  considerable  unification  of  traditional  mathematics  has  occurred,  and  new 
mathematics  has  been  created  at  an  almost  explosive  rate. 

It  is  the  purpose  of  the  present  section  to  expand  on  the  above  remarks 
and,  in  particular,  to  show  how  the  theory  of  sets  has  clarified  and  generalized 
some  of  the  fundamental  concepts  of  mathematics. 

Let  us  first  consider  some  notions  of  space  and  the  geometry  of  a  space. 
These  concepts  have  undergone  marked  changes  since  the  days  of  the  ancient 
Greeks.  For  the  Greeks  there  was  only  one  space  and  one  geometry;  these 
were  absolute  concepts.  The  space  was  not  thought  of  as  a  collection  of 
points,  but  rather  as  a  realm,  or  locus,  in  which  objects  could  be  freely  moved 
about  and  compared  with  one  another.  From  this  point  of  view,  the  basic 
relation  in  geometry  was  that  of  congruence  or  superposability. 

With  the  development  of  analytic  geometry,  space  came  to  be  regarded 
as  a  collection  of  points,  and  with  the  invention  of  the  classical  non-Euclidean 
geometries,  mathematicians  accepted  the  situation  that  there  is  more  than 
one  conceivable  space  and  hence  more  than  one  geometry.  But  space  was 
still  regarded  as  a  locus  in  which  figures  could  be  compared  with  one  another. 
The  central  idea  became  that  of  a  group  of  congruent  transformations  of 
space  into  itself,  and  a  geometry  came  to  be  regarded  as  the  study  of  those 
properties  of  configurations  of  points  which  remain  unchanged  when  the 
enclosing  space  is  subjected  to  these  transformations.  We  have  seen,  in 
Section  5.4,  how  this  point  of  view  was  expanded  by  Felix  Klein  in  his 
Erlanger  Programm  of  1872.  In  the  Erlanger  Programm,  a  geometry  was 
defined  as  the  invariant  theory  of  a  transformation  group.  This  concept 
synthesized  and  generalized  all  earlier  concepts  of  geometry,  and  supplied 
a  singularly  neat  classification  of  a  large  number  of  important  geometries. 

At  the  end  of  the  last  century  there  was  developed  the  idea  of  a  branch 
of  mathematics  as  an  abstract  body  of  theorems  deduced  from  a  set  of 
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postulates,  and  each  geometry  became,  from  this  point  of  view,  a  particular 
branch  of  mathematics.  Postulate  sets  for  a  large  variety  of  geometries  were 
studied,  but  the  Erlanger  Programm  was  in  no  way  upset,  for  a  geometry 
could  be  regarded  as  a  branch  of  mathematics  which  is  the  invariant  theory 
of  a  transformation  group. 

In  1906,  however,  Maurice  Frechet  inaugurated  the  study  of  abstract 
spaces,  and  very  general  geometries  came  into  being  which  no  longer  neces¬ 
sarily  fit  into  the  neat  Kleinian  classification.  A  space  became  merely  a  set 
of  objects,  usually  called  points,  together  with  a  set  of  relations  in  which 
these  points  are  involved,  and  a  geometry  became  simply  the  theory  of  such 
a  space.  The  set  of  relations  to  which  the  points  are  subjected  is  called  the 
structure  of  the  space,  and  this  structure  may  or  may  not  be  explainable  in 
terms  of  the  invariant  theory  of  a  transformation  group.  Through  set  theory, 
then,  geometry  received  a  further  generalization.  Although  abstract  spaces 
were  first  formally  introduced  in  1906,  the  idea  of  a  geometry  as  the  study 
of  a  set  of  points  with  some  superimposed  structure  was  really  already  con¬ 
tained  in  remarks  made  by  Riemann  in  his  famous  lecture  of  1854.  It  is 
interesting  that  some  of  these  new  geometries  have  found  valuable  application 
in  the  Einstein  theory  of  relativity,  and  in  other  developments  of  modern 
physics. 

Let  us  illustrate  the  notion  of  abstract  spaces  by  briefly  considering, 
in  turn,  a  Hausdorff  space,  a  metric  space,  and  a  topological  space.  We 
will  find  that  in  the  study  of  these  spaces  many  of  the  basic  concepts  of 
analysis  have  reached  a  remarkable  degree  of  generalization.  Since  the 
subject  of  abstract  spaces  is  based  upon  set  theory  ideas,  we  shall  freely 
employ  the  concepts  and  notations  of  set  theory  that  have  already  been 
developed  in  earlier  sections. 

Definition  1 .  A  Hausdorff  space  is  a  set  H  of  elements,  called  points , 
together  with  a  collection  of  certain  subsets  of  these  points,  called  neighbor¬ 
hoods. ,  satisfying  the  following  four  postulates: 

HI:  For  each  point  x  of  H  there  corresponds  at  least  one  neighborhood  Nx, 
where  the  symbol  Nx  means  that  x  €  Nx. 

H2:  For  any  two  neighborhoods  Nx  and  NT  o/  x,  there  exists  a  third  neighbor¬ 
hood  N"x  such  that  N"x  C  (Nx  6A  N'x). 

H3:  If  y  is  a  point  of  H  such  that  y  €  Nx,  then  there  is  a  neighborhood  Ny 
of  y  such  that  Ny  C  Nx. 

H4:  If  x  y,  there  exists  an  Nx  and  an  Ny  such  that  Nx  6A  Ny  =  0. 

To  assist  in  understanding  the  postulates  for  a  Hausdorff  space,  let  H 
be  the  set  of  all  points  on  a  straight  line,  and  select  for  the  neighborhoods 
of  a  point  x  of  H  the  segments  of  the  straight  line  which  have  x  as  midpoint. 
It  is  easy  to  see  that  the  above  postulates  are  satisfied  by  this  interpretation 
of  points  and  neighborhoods,  and  thus  we  have  an  example  of  a  Hausdorff 
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space.  The  arithmetical  counterpart  of  this  Hausdorff  space  is  very  important 
in  the  study  of  analysis. 

Within  a  Hausdorff  space  it  is  possible  to  define  the  concept  of  limit 
point  as  follows.  A  point  x  of  H  will  be  called  a  limit  point  of  a  subset  S  of 
H  provided  every  neighborhood  of  x  contains  at  least  one  point  of  S  distinct 
from  v.  There  are  some  interesting  consequences  of  this  definition  when 
taken  in  conjunction  with  the  space  given  above.  We  have,  for  example, 
the  following  theorem. 

Theorem  1.  Any  neighborhood  Nx  of  a  limit  point  x  of  a  set  S  contains  an 
infinite  number  of  points  of  S. 

Since  x  is  a  limit  point  of  S,  any  neighborhood  Nx  of  x  contains  a  point 
yi  of  S,  where  y i  5*  x.  By  Postulate  H4,  there  then  exist  neighborhoods 
NVl  of  yi  and  N'x  of  x  such  that  WH  NVl  =  0.  Again,  by  Postulate  H2, 
there  exists  a  neighborhood  N" x  of  x  such  that  N" x  C  {Nx  N' x).  It  follows 
that  £  N" x.  But,  since  x  is  a  limit  point  of  S,  N"x,  and  hence  jV*,  contains 
a  point  jy2  of  S,  where jy2  ^  x  and  7^  y\.  Continuing  in  this  way,  we  find  that 
Nx  contains  an  infinite  sequence  of  distinct  points  y  1,  y 2,  y 3  *  *  *  of  S',  and 
the  theorem  is  established. 

This  theorem  brings  out  in  striking  fashion  the  fact  that  the  concept  of 
limit  point  is  applicable  only  to  infinite  sets.  Not  every  infinite  subset  of  the 
points  of  a  Hausdorff  space  necessarily  has  a  limit  point,  but  a  theorem 
that  we  owe  to  Weierstrass  states  that  if  H  is  the  set  of  all  points  on  a  straight 
line,  and  if  by  neighborhood  of  a  point  x  of  H  we  mean  any  segment  of  the 
straight  line  which  has  x  as  midpoint,  then  any  infinite  subset  of  points  of  H 
lying  within  a  finite  segment  of  H  has  at  least  one  limit  point.  The  arithmetized 
version  of  this  theorem  is  very  fundamental  in  analysis,  and  the  theorem  is 
readily  extended  to  spaces  other  than  the  set  of  points  upon  a  straight  line. 

Among  the  various  concepts  related  to  a  Hausdorff  space,  we  might 
mention  the  following.  A  subset  S  of  H  is  said  to  be  compact  if  every  infinite 
subset  of  points  of  S  possesses  a  limit  point  which  belongs  to  S.  If  S  is  a 
subset  of  H ,  by  the  closure  of  S,  written  S,  is  meant  the  set  consisting  of  all 
points  of  S  and  all  limit  points  of  S.  A  subset  S  of  H  is  said  to  be  separable 
provided  it  contains  a  finite  or  a  denumerable  subset  E  such  that  S  C  E. 
A  subset  S  of  H  is  said  to  be  connected  provided  that  however  S  may  be  divided 
into  nonempty  subsets,  one  of  these  subsets  will  contain  a  limit  point  of 
the  other.  A  subset  S  of  H  which  is  both  compact  and  connected  is  called 
a  continuum.  If,  in  any  of  the  above  definitions,  S  —  H,  then  the  whole 
Hausdorff  space  is  said  to  possess  the  corresponding  property.  All  of  these 
properties  are  generalizations  of  analogous  properties  possessed  by  the  set 
of  all  real  numbers  x  of  a  closed  number  interval  /,  where  by  a  neighborhood 
of  x  is  meant  any  number  interval  lying  within  I  and  containing  x. 

In  many  of  the  familiar  geometries,  such  as  Euclidean  geometry,  there 
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is  the  notion  of  distance  between  two  points.  This  idea  has  been  generalized 
by  set  theory  into  the  study  of  so-called  metric  spaces.  These  are  the  abstract 
spaces  that  Frechet  introduced  in  1906. 

Definition  2.  A  metric  space  is  a  set  M  of  elements,  called  points ,  together 
with  a  real  number  p(x'y),  called  the  distance  function  or  metric  of  the  space, 
associated  with  each  pair  of  points  x  and  y  of  M ,  satisfying  the  following 
two  postulates: 

Ml :  p(x,y)  =  0  if  and  only  if  x  =  y. 

M2:  p(x,y)  p(y,z)  -f-  p(z,x),  where  x,  y,  z  are  any  three ,  not  necessarily 
distinct ,  points  of  M.  (This  is  referred  to  as  the  triangle  inequality.) 

Some  important  examples  of  metric  spaces*  are: 

1.  The  set  of  all  real  numbers  in  which  the  distance  function  is  defined 
by  p(x,y )  =  \x  —  y\.  A  simple  geometric  interpretation  of  this  space  is 
the  set  of  all  points  on  a  straight  line,  with  the  ordinary  concept  of 
distance  between  two  points. 

2.  The  set  of  all  ordered  pairs  of  real  numbers  in  which  the  distance 
function  is  defined  by 

p(plfV  =  [(*1  ”  Xz)2  +  (j>l  -^2)2]1/2, 

where 

Pi  ~  ixbjl)i  P*  =  (*2,72). 


3. 


The  student  of  plane  analytic  geometry  will  recognize  as  a  geometric 
interpretation  here  the  set  of  all  points  in  the  plane,  together  with  the 
ordinary  concept  of  distance  between  two  points. 

The  set  of  all  ordered  pairs  of  real  numbers  in  which  the  distance 
function  is  defined  by 


where 


p(pi,p2)  =  |*2  -  xi\  +  | y2 


y  1 


pi  =  (xhy  1),  p2  =  (*2,72). 


By  plotting  on  the  Cartesian  plane,  the  reader  will  readily  see  why  this 
space  is  frequently  referred  to  as  taxi-cab  space. 

4.  The  set  of  all  infinite  sequences  v  =  {x\,  x%,  •  •  •}  of  real  numbers, 

00 

for  which  the  infinite  series  X  T  is  convergent,  in  which  the  distance 

'4=1 


function  is  defined  by 

p(x,y)  = 


00 


X  (xi  -yd 


2 


—  2=1 


1/2 


This  example  of  a  metric  space  is  known  as  Hilbert  space. 


*We  shall  not  prove  that  the  spaces  are  metric  spaces.  A  proof  is  easy  for  (1)  and  (3) 
but  is  somewhat  difficult  for  (2)  and  (4). 
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We  shall  now  list  several  theorems  about  metric  spaces. 

Theorem  2.  p(x, y)  =  p( y,x). 

By  Postulate  M2  we  have 

p(v)  =  p(y>z)  +  pfc*) 

and,  by  interchanging  a-  andjy, 

p(y,x)  ^  p(x,z)  +  p(z,y). 

Setting  z  =  x  in  the  first  of  these  inequalities,  and  z  —  y  in  the  second  one, 
we  find  (recalling  Postulate  Ml) 

p(x,y)  S  p(y,x),  p(y,x)  ^  p(*,jO- 
It  now  follows  that  p(x,y)  —  p(y,x). 

Theorem  3.  p(x, y)  ^  0. 

In  M2,  setjy  =  a  and  Z  =  y.  Then,  since  0  =  p(a,a)  by  Ml,  0  ^  p(x,  y) 
+  p(y,x)  =  2 p(x,y)  by  Theorem  2.  Hence  p(x,y)  ^  0. 

Theorem  4.  Any  subset  of  a  metric  space  is,  for  the  same  metric ,  a  metric  space. 

Theorem  5.  Any  metric  space  can  be  made  into  a  Hausdorjf  space. 

Let  x  be  any  point  of  a  metric  space  M,  and  consider  the  subset  of  all 
points  y  of  M  such  that  p(x,y)  <  r,  where  r  is  a  positive  real  number.  Such 
a  subset,  which  is  nonempty  inasmuch  as  it  necessarily  contains  a,  will  be 
called  an  open  sphere  and  will  be  denoted  by  S(x,r).  We  now  show  that  these 
open  spheres,  if  regarded  as  neighborhoods,  satisfy  the  four  postulates  of  a 
Hausdorff  space. 

We  omit  the  obvious  verification  of  HI  and  H2. 

To  verify  H3,  let  p(x,y)  <  r,  and  put  R  =  r  —  p( xy)  >  0.  The  triangle 
inequality  states  that  p(a,/)  ^  P(a, y)  +  p(j>,/)  =  (r  -  R)  +  p(y,y')  <  r, 
if  p(yy')  <  R-  That  is,  S(y,R)  C  S(x,r). 

To  verify  H4,  let  a  be  distinct  from  y  and  set  r  =  p(x,y)  >  0.  Then 
it  is  easy  to  show  that  the  two  open  spheres  ^(a,  r/3)  and  r/3)  have  no 
points  in  common. 

The  important  inverse  problem,  namely,  the  discussion  regarding  to 
what  extent  any  Hausdorff  space  can  be  made  into  a  metric  space,  cannot 
be  taken  up  here,  and  the  interested  reader  must  consult  a  more  exhaustive 
treatment  of  the  subject. 

Some  interesting  definitions  can  be  introduced  in  connection  with  metric 
spaces.  Thus,  to  define  a  limit  point  a  of  a  subset  S  of  M  by  the  use  of  the 
metric  p(x,y),  it  is  sufficient  to  state  that  for  any  given  positive  number  e 
there  exists  a  pointjy  of  S’  such  thatjy  9^  x  and  p(x,y)  <  e.  Again,  the  diameter 
of  a  subset  S  of  M  may  be  defined  as  the  least  upper  bound  of  p(x,y),  where 
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x  and  y  are  points  in  S.  Similarly,  the  distance  between  two  subsets  Si  and  Sz 
of  M  may  be  defined  as  the  greatest  lower  bound  of  p(x,y),  where  x  is  a 
point  in  ST  andjy  is  a  point  in  S 2. 

An  abstract  space  with  a  very  simple  and  very  basic  structure  is  a 
so-called  topological  space. 

Definition  3.  A  topological  space  *  is  a  set  7  of  elements,  called  points , 
with  a  collection  of  subsets  of  these  points,  called  open  sets ,  satisfying  the 
following  three  postulates. 

T1 :  T  and  the  null  set  0  are  open  sets. 

T2:  The  union  of  any  number  of  open  sets  is  an  open  set. 

T3:  The  intersection  of  any  two  open  sets  is  an  open  set. 

When,  for  a  given  point  set  7,  we  devise  a  system  of  subsets  of  T  satis¬ 
fying  the  above  postulates,  we  say  that  the  set  T  has  been  topologized,  or  has 
received  a  topology.  A  given  set  T  may  be  topologized  in  more  than  one  way. 
In  fact,  it  is  an  easy  matter  to  show  that  if  we  select  for  the  collection  of 
open  sets  of  T  all  of  the  subsets  of  7,  then  7  is  topologized;  7  is  also  topolo¬ 
gized  if  we  choose  for  the  open  sets  of  7  only  the  two  sets  7  and  0.  The 
first  of  these  two  topologies  imposed  on  7  is  called  the  discrete  topology  of  7 ; 
the  second  one  is  called  the  trivial  topology  of  7.  The  notion  of  limit  point 
can  be  introduced  into  a  topological  space  by  the  definition:  A  point  p  of  7 
is  called  a  limit  point  of  a  subset  S  of  7  if  every  open  set  containing  p  (whether 
p  belongs  to  S  or  not)  contains  at  least  one  point  of  S  different  from  p.  The 
study  of  the  consequences  of  the  postulates  for  a  topological  space  is  called 
general  topology ,  and  the  limit  point  concept  plays  an  important  role  in  this 
study.  We  shall  content  ourselves  with  the  statement  of  only  one  theorem 
about  topological  spaces— a  theorem  which  links  topological  and  Hausdorff 
spaces. 

Theorem  6.  A  topological  space  for  which  there  exist ,  for  any  two  distinct 
points  x  and  y,  disjoint  open  sets  Sx  and  Sy  containing  x  and  y,  respectively ,  is  a 
Hausdorff  space  with  respect  to  the  open  sets  as  neighborhoods. 

We  leave  it  to  the  reader  to  show  that  the  postulates  for  a  Hausdorff 
space  are  satisfied  if  one  chooses  for  neighborhoods  the  open  sets  of  7. 

Having  given  some  consideration  to  the  matter  of  individual  sets 
possessing  a  superimposed  structure,  we  shall  now  very  briefly  look  at  pairs 
of  related  sets.  A  considerable  amount  of  mathematical  theory  is  concerned 
with  two  related  sets.  This  leads  to  the  concept  of  function  and,  consequently, 
to  the  elaborate  branch  of  mathematics  known  as  function  theory. 

The  concept  of  function,  like  the  notions  of  space  and  geometry,  has 
undergone  a  marked  evolution,  and  every  student  of  mathematics  encounters 
various  refinements  of  this  evolution  as  his  studies  progress  from  the  ele- 

*The  definition  of  a  topological  space  is  not  uniform  in  the  literature. 
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mentary  courses  of  high  school  into  the  more  advanced  and  sophisticated 
courses  of  the  graduate  college  level. 

The  history  of  the  notion  of  function  furnishes  another  interesting 
example  of  the  tendency  of  mathematicians  to  generalize  and  extend  their 
concepts.  The  word  “function,”  in  its  Latin  equivalent,  seems  to  have  been 
introduced  by  Leibniz  (1646-1716)  in  1694,  at  first  as  a  term  to  denote 
any  quantity  connected  with  a  curve,  such  as  the  coordinates  of  a  point  on 
the  curve,  the  slope  of  the  curve,  and  so  on.  Johann  Bernoulli  (1667-1748), 
by  1718,  had  come  to  regard  a  function  as  any  expression  made  up  of  a 
variable  and  some  constants,  and  Euler  (1707-1783),  somewhat  later, 
regarded  a  function  as  any  equation  or  formula  involving  variables  and 
constants.  This  latter  idea  is  the  notion  of  a  function  formed  by  most  students 
of  elementary  mathematics  courses.  The  familiar  notation f(x)  was  not  used 
at  first,  but  entered  about  1734  with  A.  C.  Clairaut  (1713-1765)  and  Euler. 
The  Euler  concept  of  function  remained  unchanged  until  Fourier  (1768- 
1830)  was  led,  in  his  investigations  of  heat  flow,  to  consider  so-called 
trigonometric  series.  These  series  involve  a  more  general  type  of  relationship 
between  variables  than  had  previously  been  studied,  and,  in  an  attempt  to 
furnish  a  definition  of  function  broad  enough  to  encompass  such  relation¬ 
ships,  Lejeune  Dirichlet  (1805-1859)  arrived  at  the  following  formulation: 
A  variable  is  a  symbol  which  represents  any  one  of  a  set  of  numbers;  if 
two  variables  *  and  y  are  so  related  that  whenever  a  value  is  assigned  to  x 
there  is  automatically  assigned,  by  some  rule  or  correspondence,  a  value  to 
y,  then  we  say  y  is  a  (single-valued)  function  of  x.  The  variable  x,  to  which 
values  are  assigned  at  will,  is  called  the  independent  variable ,  and  the  variable 
V,  whose  values  depend  upon  those  of  x,  is  called  the  dependent  variable.  The 
permissible  values  that  x  may  assume  constitute  the  domain  of  definition  of 
the  function,  and  the  values  taken  on  by  y  constitute  the  range  of  values  of 
the  function. 

The  student  of  mathematics  usually  meets  the  Dirichlet  definition  of 
function  in  his  introductory  course  in  calculus.  The  definition  is  a  very  broad 
one  and  does  not  imply  anything  regarding  the  possibility  of  expressing  the 
relationship  between  x  andjy  by  some  kind  of  analytic  expression;  it  stresses 
the  basic  idea  of  a  relationship  between  two  sets  of  numbers. 

Set  theory  has  naturally  extended  the  concept  of  function  to  embrace 
relationships  between  any  two  sets  of  elements,  be  the  elements  numbers  or 
anything  else.  Thus,  in  set  theory,  a  function  /  is  defined  to  be  any  set  of 
ordered  pairs  of  elements  such  that  if  (ahbx)  €  /,  (a2ff)  €  /,  and  ax  =  a2, 
then  b\  —  b2.  The  set  A  of  all  first  elements  of  the  ordered  pairs  is  called  the 
domain  (of  definition )  of  the  function,  and  the  set  B  of  all  second  elements 
of  the  ordered  pairs  is  called  the  range  (of  values)  of  the  function.  A  functional 
relationship  is  thus  nothing  but  a  special  kind  of  subset  of  the  Cartesian 
product  set  A  X  B.  A  one-to-one  correspondence  is,  in  its  turn,  a  special 
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kind  of  function,  namely,  a  function  /such  that  if  (ahbi)  €  /,  (a2,b2)  €/,  and 
b i  =  b 2,  then  =  a2.  If,  for  a  functional  relationship /,  (a/)  £ /,  we  write 
b  =  /(a),  which  we  read,  “b  equals  /  at  a.” 

The  notion  of  function  pervades  much  of  mathematics,  and  since  the 
early  part  of  the  present  century  various  influential  mathematicians  have 
advocated  the  employment  of  this  concept  as  the  unifying  and  central 
principle  in  the  organization  of  elementary  mathematics  courses.  The  concept 
seems  to  form  a  natural  and  effective  guide  for  the  selection  and  development 
of  textual  material.  There  is  no  doubt  of  the  value  of  a  mathematics  student’s 
early  acquaintance  with  the  function  concept. 

The  property  of  continuity  may  immediately  be  associated  with  the 
concept  of  function  if  the  study  pertains  to  spaces  A  and  B  defined  by  a 
system  of  neighborhoods.  In  fact,  a  function  /  may  be  said  to  be  continuous 
at  a  point  a  of  A  if  for  any  neighborhood  JV&  C  B,  where  b  is  the  point  of  B 
associated  with  the  given  point  a  of  A,  there  exists  a  neighborhood  jVffl  C  A 
such  that  for  any  z  €  JVa,  f(z)  €  JVj.  Other  definitions  of  continuity  are 
possible. 
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8.1.1  Show  that  if  A  C  B  and  B  C  A,  then  A  =  B. 

8.1.2  If  a  and  b  are  real  numbers  and  a  <  b,  let 

[i a ,  b  ]  represent  the  number  interval  a  ^  x  ^  b, 
( a ,  b  ]  represent  the  number  interval  a  <  x  ^  b, 


[c 

z,  b)  represent  the  number  interval  a 

< 

x  <  b, 

0 

z,  b)  represent  the  number  interval  a 

< 

x  <  b. 

Find 

(a) 

[i, 

3] 

w  [3,  4], 

(H) 

[1,  3] 

n 

[2, 

4]. 

(b) 

[i. 

3] 

U  [2,4], 

(i) 

[1,  3] 

n 

[3, 

7]. 

(c) 

[i, 

7] 

VJ  [2,  5], 

(j) 

[1,  3] 

n 

[5, 

7]. 

(d) 

[i. 

3) 

VJ  [3,  4], 

(k) 

[1,  3) 

n 

[2, 

4]. 

(e) 

a, 

3) 

C 

1  1 

Oo 

4^ 

(1) 

[1,  3] 

n 

(2, 

4]. 

(0 

a, 

3) 

C 

I  i 

yp 

(m) 

[1,  3) 

n 

(2, 

4). 

(g) 

a, 

3) 

O  (3,  4). 

(n) 

(1,  3) 

n 

(3, 

7). 

8.1.3  Show  that  0  O'  A  = 

A  and  0  A  — 

0. 

8.1.4  (a)  If  B  G  A,  show  that  B  O'  A  =  A  and  B  f~\  A  —  B. 

(b)  Show  that  A  W  A  =  A  and  A  C\  A  =  A. 

8.1.5  Let  A  be  the  set  of  all  points  of  a  Cartesian  plane  such  that  .*  ^  0, 
and  let  B  be  the  set  of  all  points  of  the  plane  such  that  x  ^  0.  Find  A  VJ  B 
and  A  B. 
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8.1.6  Let  I  =  [  —  50,  50].  Find  the  following  sets  (see  Problem  8.1.2  for 


meaning  of  notation): 

(a)  [-10,10]', 

(b)  (5,  15)'. 

(c)  0'. 

(d)  (-50,  50)'. 

(e)  [-5,  0]  W  [5,  10]'. 


(f)  [-5,  0]  rr  [5, 10]'. 

(g)  ([-5,  0]  O  [5,  10]]'. 

(h)  [(  —  5,  0)  W  (5,  10)')'. 

(i)  [[-5,  01  rr  [5, 10]]'. 

(j)  [(-5, 0)  n  (5,  10)']'. 


8.1.7  Show  that 

(a)  A  KJ  A'  =  I. 

(b)  A  r\  A'  =  0. 

(c)  (A'Y  =  A. 

8.1.8  Show  that  A  C  B  if  and  only  if  A  O  B'  =  0. 


8.1.9  Verify,  by  Venn  diagrams, 

(a)  the  associative  laws  for  W  and  C\, 

(b)  the  distributive  law  of  VJ  with  respect  to  (~\, 

(c)  the  distributive  law  of  C\  with  respect  to  ''-J . 

8.1.10  Label  each  of  the  eight  regions  of  Figure  37  by  an  appropriate 
expression  involving  A,  B,  and  C  which  will  exactly  describe  that  region. 


8.1.11  By  shading  the  appropriate  regions  on  a  Venn  diagram,  determine 
which  of  the  following  relationships  are  true: 

(a)  ( A '  \J  B)’  =  A  r\  B'. 

(b)  A'  KJ  B'  =  (1U  BY. 

(c)  a  \j  (b  n  c)'  =  (iu  b')  r\  c. 

(d)  AVJ  (AC\B)  =  A. 

(e)  A  (d  \J  B)  =  A. 

8.1.12  (a)  Show  that  a  set  A  consisting  of  five  elements  contains  25  subsets 
(including  itself  and  the  null  set). 

(b)  Let  A  be  a  set  with  seven  elements  and  B  a  set  with  five  elements. 
What  can  be  said  about  the  number  of  elements  in  the  sets  A  C\  B  and 
A  \J  B? 

(c)  Generalize  parts  (a)  and  (b). 
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8.1.13  (a)  Let  D  =  A  VJ  B  and  E  =  B  VJ  C.  Show  that  D  VJ  C  =  T  VJ  E. 

(b)  Let  D  =  A  E\  B  and  E  =  B  C.  Show  that  D  C\  C  =  A  C\  E. 

8.2.1  Establish  the  following  theorems  of  Boolean  algebra: 

(a)  The  identity  elements  z  and  u  are  unique. 

(b)  z'  =  u  and  u'  =  z- 

(c)  (a')'  =  a. 

8.2.2  Establish  the  following  theorems  of  Boolean  algebra: 

(a)  (a  n  b)  VJ  (b  r\  c)  VJ  (c  n  a)  =  (s  VJ  b)  C\  (b  VJ  c)  E\  (c  VJ  a). 

(b)  If  a  C\  b  =  a  C\  c  and  a  O'  b  =  a  \J  c,  then  b  =  c. 

(c)  Show  that  the  above  two  theorems  are  self-dual. 


8.2.3  Prove  Poretsky’s  law:  For  given  x  and  t,  x  —  z  if  and  only  if  t  = 
(x  n  t')  KJ  (xf  H  /). 


8.2.4  Establish  the  following  theorems  of  Boolean  algebra: 

(a)  a  C  {a\J  b). 

(b)  {a  E\  b)  G  £• 

(c)  If  v  C  a  and  x  C  b,  then  x  (Z  (a  C\  b).  (This  shows  that  a  C\  b  is  the 
greatest  lower  bound  of  a  and  b  relative  to  the  partial  ordering  relation  C-) 

(d)  If  a  C  b,  then  (a  VJ  c)  C  (b  W  c)  and  (a  C\  c)  C  (b  C\  c). 

8.2.5  Establish  the  following  theorems  of  Boolean  algebra: 

(a)  ci  d_  b  if  and  only  if  a  C\  b'  =  z- 

(b)  a  (Z  b  if  and  only  if  b'  C  a' ■ 

(c)  ci  CZ  b  if  and  only  if  a  C\  b  =  a. 

(d)  b  C  ci'  if  and  only  if  a  C\  b  =  z> 

(e)  a'  C  b  if  and  only  if  a  \J  b  =  u. 

8.2.6  Let  B  =  {1,  2,  3,  5,  6,  10,  15,  30 j  and  let  a  KJ  b  =  the  least  common 

multiple  of  a  and  b,  and  a  C~\  b  =  the  greatest  common  divisor  of  a  and  b. 

Show  that  B ,  together  with  VJ  and  C\,  constitutes  a  Boolean  algebra. 

8.2.7  Let  B  =  {z,  «J,  with  binary  operations  VJ  and  E\  defined  by  the 
following  tables: 


VJ 

Z 

u 

n 

Z 

u 

z 

z 

u 

Z 

z 

z 

u 

u 

u 

u 

z 

u 

Show  that  B,  with  VJ  and  f\  constitutes  a  Boolean  algebra. 


8.2.8  Let  B  —  {a,  b,  c,  d],  with  binary  operations  hJ  and  Jh  defined  by  the 
following  tables: 


yj 

a 

b 

c 

d 

a 

a 

b 

c 

d 

b 

b 

b 

b 

b 

c 

c 

b 

c 

b 

d 

d 

b 

b 

d 

n 

a 

b 

c 

d 

a 

a 

a 

a 

a 

b 

a 

b 

c 

d 

c 

a 

c 

c 

a 

d 

a 

d 

a 

d 

Show  that  B ,  with  kJ  and  O,,  constitutes  a  Boolean  algebra. 
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8.2.9  (a)  Define  operations  of  addition  and  multiplication  in  a  Boolean 
algebra  by 

a  +  b  =  (a  (~\  b')  (a'  r\  b),  ab  =  a  C3  b. 

Show  that  B  constitutes  a  commutative  ring  with  respect  to  these  operations 
of  addition  and  multiplication.  (For  a  definition  of  a  commutative  ring,  see 
Section  5.2.) 

(b)  Show  that  in  the  ring  of  part  (a),  a2  =  a  for  each  element  a  of  B. 
(A  ring  having  this  latter  property  is  called  a  Boolean  ring.) 

(c)  Show  that  a  a  =  z  for  each  element  a  of  B. 

(d)  Show  that  ab(a  +  b)  =  z  for  all  elements  a ,  b  of  B. 

8.2.10  Establish  the  independence  of  the  postulates  for  Boolean  algebra 
given  in  Section  8.2. 

8.2.11  Referring  to  Definition  1  of  Section  8.2,  write  b  Z)  a  (read  “b  con¬ 
tains  a”)  if  and  only  if  a  C  b.  Show  that  b  D  a  if  and  only  if  b  H  a  =  a, 
thus  showing  that  C  and  D  are  dual  symbols. 

8.3.1  Whether  an  end  point  A  (or  B)  of  a  line  segment  AB  is  to  be  con¬ 
sidered  as  belonging  or  not  belonging  to  the  segment  will  be  indicated  by 
using  a  bracket  or  a  parenthesis,  respectively,  about  the  letter  A  (or  B). 
Using  this  notation,  show  that  the  segments  [AB],  (AB],  [AB),  (AB),  con¬ 
sidered  as  sets  of  points,  are  equivalent  to  one  another. 

8.3.2  Show,  starting  with  Figure  38,  that  the  set  of  points  composing  any 
finite  segment  and  the  set  composing  any  infinite  segment  are  equivalent  to 
each  other. 


8.3.3  Show  that  equivalence  of  sets  is  reflexive,  symmetric,  and  transitive. 

8.3.4  (a)  Let  A  =  jl,  2,  3,  4)  and  B  =  [a,  b,  c] .  Form  the  sets  A  X  B  and 
B  X  A. 

(b)  Take  A  and  B  as  in  part  (a)  and  let  C  —  {x,y}.  Exhibit  the  sets 
A  X  (B  U  C),  A  X  (B  X  C),  (A  X  B)  X  C. 

8.3.5  (a)  Carry  out  the  details  of  the  proof  of  Theorem  1  of  Section  8.3. 
(b)  Prove  Theorem  4  of  Section  8.3. 

8.3.6  Show  that  the  cancellation  laws  for  addition  and  multiplication  do 
not  hold  for  cardinal  numbers  of  infinite  sets. 
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8.4.1  Devise  an  infinite  sequence  of  subsets  of  the  set  of  natural  numbers 
such  that  each  subset  is  a  proper  subset  of  its  predecessor  and  yet  has  cardinal 
number  d. 

8.4.2  Explain  how  Figure  39  shows,  with  the  aid  of  Dedekind’s  definition 
of  infinite  set,  that  the  set  of  points  on  a  line  segment  is  an  infinite  set. 


8.4.3  Prove  that  the  assumed  existence  of  at  least  one  rational  number 
between  any  two  rational  numbers  implies  the  existence  of  a  denumerable 
number  of  rational  numbers  between  any  two  rational  numbers. 

8.4.4  (a)  Show  that  every  rational  number  is  an  algebraic  number  and 
hence  that  every  real  transcendental  number  is  irrational. 

(b)  Is  every  irrational  number  a  transcendental  number? 

(c)  Is  the  imaginary  unit  i  algebraic  or  transcendental? 

(<()  Show  that  7r/2  is  transcendental. 

(e)  Show  that  7t  +  1  is  transcendental. 

(0  Generalize  parts  (d)  and  (e). 

(g)  Show  that  any  complex  number  which  is  a  zero  of  a  polynomial 
of  the  form 

QqX11  d-  a\Xn~x  -f-  .  .  .  -f-  an-\X  -p  an, 

where  <?o  ^  0  and  all  the  ads  are  rational  numbers ,  is  an  algebraic  number. 

8.4.5  (a)  Prove  that  the  union  of  a  finite  number  of  denumerable  sets  is  a 
denumerable  set. 

(b)  Prove  that  the  union  of  a  denumerable  number  of  denumerable 
sets  is  a  denumerable  set. 

8.4.6  (a)  Show  that  1  is  the  only  polynomial  of  height  1. 

(b)  Show  that  x  and  2  are  the  only  polynomials  of  height  2. 

(c)  Show  that  x2,  2x,  x  +  1,  x  —  1,  and  3  are  the  only  polynomials  of 
height  3,  and  that  they  yield  the  distinct  algebraic  numbers  0,  1,  —1. 

(a>  Form  all  possible  polynomials  of  height  4  and  show  that  the  only 
new  real  algebraic  numbers  contributed  are  —2,  —1/2,  1/2,  2. 
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(e)  Show  that  polynomials  of  height  5  contribute  12  more  real  algebraic 
numbers. 

8.4.7  (a)  Show  that  the  set  of  all  irrational  numbers  is  nondenumerable. 
(b)  Show  that  the  set  of  all  transcendental  numbers  is  nondenumerable. 

8.4.8  (a)  Show  that  if  a  circle  has  a  center  with  at  least  one  irrational 
coordinate,  then  there  are  at  most  two  points  on  the  circle  with  rational 
coordinates. 

(b)  Show  that  if  a  circle  has  a  center  with  at  least  one  transcendental  co¬ 
ordinate,  then  there  are  at  most  two  points  on  the  circle  with  algebraic 
coordinates. 

(c)  Is  it  possible  for  a  straight  line  or  a  circle  in  the  Cartesian  plane 
to  contain  only  points  having  rational  coordinates?  algebraic  coordinates? 

8.4.9  (a)  Complete  the  details  of  the  following  proof  that  the  set  of  all 
points  on  a  line  segment  AB  is  nondenumerable: 

Take  the  length  of  AB  to  be  1  unit,  and  assume  that  the  points  on  AB 
constitute  a  denumerable  set.  The  points  on  AB  can  then  be  arranged  in  a 
sequence  {Pi,  P2,  Pi,  •  •  •}.  Enclose  point  Pi  in  an  interval  of  length  1/10, 
point  P2  in  an  interval  of  length  (1/1 0) 2,  point  P 3  in  an  interval  of  length 
(1/10)3,  and  so  on.  It  follows  that  the  unit  interval  AB  is  entirely  covered 
by  an  infinite  sequence  of  possibly  overlapping  subintervals  of  lengths  1/10, 
(1/1 0) 2,  (1/10)3,  •  •  •.  But  the  sum  of  the  lengths  of  these  subintervals  is 


(b)  By  choosing  the  subintervals  in  part  (a)  to  be  of  lengths  e/10,  e/102, 
e/103,  •  •  •,  where  e  is  an  arbitrarily  small  positive  number,  show  that  a 
denumerable  set  of  points  can  be  covered  by  a  set  of  intervals  the  sum  of 
whose  lengths  can  be  made  as  small  as  we  please.  (Using  the  terminology 
of  measure  theory,  we  say  that  a  denumerable  set  of  points  has  zero  measure.) 

8.4.10  Consider  the  following  two  arguments: 

I.  Theorem.  Of  all  triangles  inscribed  in  a  circle ,  the  equilateral  is  the  greatest. 

(1)  If  ABC  is  a  nonequilateral  triangle  inscribed  in  a  circle,  so  that 

AB  7^  AC,  say,  construct  triangle  XBC ,  where  X  is  the  intersection  of  the 
perpendicular  bisector  of  BC  with  arc  BAC.  (2)  Then  triangle  XBC  > 
triangle  ABC.  (3)  Hence,  if  we  have  a  nonequilateral  triangle  inscribed  in  a 
circle,  we  can  always  construct  a  greater  inscribed  triangle.  (4)  Therefore, 
of  all  triangles  inscribed  in  a  circle,  the  equilateral  is  the  greatest. 

II.  Theorem.  Of  all  natural  numbers,  1  is  the  greatest. 

(1)  If  m  is  a  natural  number  other  than  1,  construct  the  natural  number 
m2.  (2)  Then  m 2  >  m.  (3)  Hence,  if  we  have  a  natural  number  other  than  1, 
we  can  always  construct  a  greater  natural  number.  (4)  Therefore,  of  all 
natural  numbers,  1  is  the  greatest. 

Now  the  conclusion  in  argument  I  is  true,  and  that  in  argument  II  is 
false.  But  the  two  arguments  are  formally  identical.  What,  then,  is  wrong? 
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8.4.11  Let  Ei  denote  the  set  of  all  points  on  the  segment  (0,  1  ]  and  E 2 
the  set  of  all  points  in  the  unit  square  0  <  x,  y  5^  1 .  A  point  £  of  Ex  may 
be  designated  by  an  unending  decimal  z  =  Q-ZiZiZz  •  •  •  lying  between  0  and 
1,  and  a  point  P  of  £2  may  be  designated  by  an  ordered  pair  of  unending 
decimals 

0  =  X). *1*2*3  •  •  -  ,y  =  O.yi y2y 3  •  •  •)> 

each  decimal  lying  between  0  and  1.  Suppose  we  let  each  Zi,  *t,  y%  in  these 
representations  denote  either  a  nonzero  digit  or  a  nonzero  digit  preceded 
by  a  possible  block  of  zeros.  For  example,  if  z  —  0.73028007  •••,  then 
Zi  =  7,  zi  —  3,  £3  =  02,  za  =  8,  Zb  =  007,  •  •  •.  Show  that  a  one-to-one 
correspondence  may  be  set  up  between  the  points  of  Ex  and  those  of  E 2 
by  associating  with  the  point  0.ziZ2Zs  •  •  •  of  Ex  the  point 

(0.£l£3£5  *  *  *,  0.Z2Z4Z6  *  •  •) 

of  £2,  and  with  the  point 

(0. *1*2*3  •  •  •,  0.yxy2ys  •  •  •) 

of  £2  the  point  0.*ij^i*2_>’2*3_>’3  •  •  •  of  E x.  Thus  show  that  the  set  of  all  points 
in  a  unit  square  has  the  cardinal  number  c.  (This  shows  that  the  dimension 
of  a  manifold  cannot  be  distinguished  by  the  cardinal  number  of  the 
manifold.) 

8.4.12  (a)  Show  that  any  infinite  set  of  mutually  external  closed  intervals 
on  a  straight  line  is  denumerable. 

(b)  Show  that  any  infinite  set  of  mutually  external  circles  lying  in  a 
plane  is  denumerable. 

8.4.13  Show  that  the  set  of  all  finite  sequences  of  non-negative  integers  is 
denumerable. 

8.4.14  Let  S  denote  the  set  of  all  single-valued  functions  of  one  variable  * 
which  assume  positive  integral  values  whenever  *  is  a  positive  integer.  Show 
that  S  is  nondenumerable. 

8.4.15  The  postulate  set  PI,  P2,  P3,  P4,  of  Section  6.2  is  categorical  (see 
Problem  6.4.4).  Show  that  if  P4  is  replaced  by  P'4:  K  consists  of  exactly  d 
distinct  elements ,  then  the  set  is  no  longer  categorical. 

8.5.1  (a)  Show  that  the  set  of  all  points  in  the  plane  can  be  made  into  a 
Hausdorff  space  by  selecting  for  neighborhoods  of  a  point  P  the  interiors  of 
all  circles  having  P  as  center. 

(b)  Show  that  the  set  of  all  points  in  the  plane  can  be  made  into  a 
Hausdorff  space  by  selecting  for  neighborhoods  of  a  point  P  the  interiors 
of  all  squares  having  centers  at  P  and  having  sides  parallel  to  two  given 
perpendicular  lines  of  the  plane. 

(c)  Show  that  any  set  of  points  can  be  made  into  a  Hausdorff  space  if 
we  select  for  neighborhoods  the  points  themselves. 

8.5.2  Let  H  be  the  Hausdorff  space  of  Problem  8.5.1(a).  Find  the  set  of 
limit  points  of  each  of  the  following  sets  of  points  in  H  (here  described,  for 
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convenience,  in  terms  of  a  Cartesian  coordinate  system  superimposed  on  the 
plane) : 

(a)  All  points  (x,y)  for  which  x  >  0. 

(b)  All  points  (x,y)  such  that  x2  y2  <  9. 

(c)  The  set  of  points  (1,  1),  (J,  §),  (5,  §),  •  •  •. 

(d)  All  points  ( x,y )  such  that^i  =  sin  (1/V)  and  0  <  1  S  1. 

(e)  The  set  of  all  points  (m/n,  1  /n),  where  m  is  an  integer  and  n  is  a 
positive  integer. 

8.5.3  Define  a  circle  with  center  C  and  radius  r  in  a  metric  space  M  as  the 
set  of  all  points  X  of  M  such  that  p(C,  X )  =  r.  Describe  the  appearance  of  a 
circle  in  each  of  the  Examples  1,  2,  3  of  metric  spaces  given  in  Section  8.5. 

8.5.4  (a)  Prove  Theorem  4  of  Section  8.5. 

(b)  Complete  the  proof  of  Theorem  5  of  Section  8.5. 

(c)  Prove  Theorem  6  of  Section  8.5. 

8.5.5  Show  that  if  p(x,y)  is  a  metric  for  a  set  M  of  points,  then  we  may 
also  use  as  a  metric  for  M 

(a)  k  p(x,y),  where  A;  is  a  positive  real  number, 

(b)  V  p(x,y), 

(c)  p(x,y)/[l  +  p(x,y)  ]. 

8.5.6  (a)  Show  that  any  set  M  of  elements  can  be  made  into  a  metric  space 
by  setting  p(x,y)  =  1  if  x  y6  y,  and  p(x,y)  =  0  if  *  —  y. 

(b)  Show  that  the  set  M  of  all  ordered  pairs  of  real  numbers,  in  which 
the  distance  function  is  defined  by  p(pi,p2)  —  max  (|x2  —  *i[,  \yz  —  y i|), 
where  pi  =  (*i,jyi),  p2  =  (*2,^2),  is  a  metric  space. 

(c)  Show  that  the  set  M  of  all  points  on  the  circumference  of  a  circle, 
in  which  the  distance  between  two  points  is  defined  as  the  length  of  the 
shorter  arc  of  the  circle  connecting  the  two  points,  is  a  metric  space. 

(d)  Generalize  part  (c)  to  the  set  M  of  all  points  on  the  surface  of  a 
sphere. 

8.5.7  (a)  Let  T  be  the  three  distinct  points  A,  B,  C,  and  let  the  open  sets 
of  T  be  0,  |d),  {A,  B],  [A,  C],  T.  Show  that  T  is  a  topological  space. 

(b)  Show  that  T  in  part  (a)  is  not  a  Hamsdorff  space  if  the  open  sets 
of  T  are  chosen  as  neighborhoods. 

(c)  The  example  of  part  (a)  shows  that  the  postulates  for  a  topological 
space  are  consistent.  Show,  by  appropriate  models,  that  these  postulates  are 
independent. 

8.5.8  Establish  the  consistency  and  independence  of  the  postulates  for  a 
Hausdorff  space. 

8.5.9  Establish  the  consistency  and  independence  of  the  postulates  for  a 
metric  space. 

8.5.10  ( for  students  who  have  studied  calculus )  Show  that  the  definition  of 
continuity  of  a  function  appearing  at  the  end  of  Section  8.5  is  a  generalization 
of  the  concept  of  continuity  of  a  real  function  of  a  real  variable  as  defined 
in  the  usual  course  in  calculus. 
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8.5.11  Let  A  and  B  be  two  sets  of  elements.  Show  that  we  may  define 'a 
subset  R  of  A  X  B  to  be  a  dyadic  relation  between  the  elements  of  A  and 
the  elements  of  B. 

8.5.12  In  elementary  mathematics,  functions  of  a  limited  sort  only  are 
considered.  Thus  a  typical  definition  of  “function”  in  elementary  mathe¬ 
matics  might  read  as  follows:  A  function  is  a  rule  which  assigns  one  and  only 
one  number  to  each  number  in  some  collection  of  numbers. 

(a)  Show  that  this  definition  is  a  special  case  of  the  more  general 
definition  of  a  function  as  given  in  Section  8.5. 

(b)  What  constitutes  the  domain  of  definition  in  the  above  elementary 

concept  of  a  function? 

(c)  What  constitutes  the  range  of  values  of  a  function  as  defined  in 
elementary  mathematics? 

(d)  In  what  sense  can  the  equation  F  =  (9/5)  C  +  32  be  regarded  as 

a  function  in  elementary  mathematics? 

(e)  In  what  sense  can  a  bar  graph  showing  the  number  of  students 
failing  a  certain  course  in  each  of  certain  years  be  regarded  as  a  function  in 
elementary  mathematics? 

(f)  In  what  sense  can  a  table  of  trigonometric  tangents  be  regarded 

as  a  function  in  elementary  mathematics? 

(g)  Compare  a  function  to  a  machine  equipped  with  a  hopper  at  the 
top  and  a  spout  at  the  bottom.  Raw  materials  are  fed  into  the  machine 
through  the  hopper.  The  machine  then  converts  these  raw  materials  into  a 
product  which  is  expelled  through  the  spout  of  the  machine. 


_  CHAPTER  NINE 

LOGIC  AND  PHILOSOPHY 


9.1  Symbolic  Logic 

We  noted,  in  Chapter  6,  that  a  mathematical  theory  results  from  the 
interplay  of  two  factors,  a  set  of  postulates  and  a  logic.  The  set  of  postulates 
constitutes  the  basis  from  which  the  theory  starts,  and  the  logic  constitutes 
the  rules  by  which  such  a  basis  may  be  expanded  into  a  body  of  theorems. 
The  mathematical  theory,  or  system,  comprises  the  totality  of  statements 
composed  of  both  the  postulates  and  the  theorems. 

Considerable  attention  has  been  paid  in  the  preceding  chapters  to  the 
first  of  the  two  factors,  namely,  the  underlying  postulate  set,  but  little 
attention  has  been  paid  to  the  second  factor,  namely,  the  logic  employed 
in  deducing  the  theorems  of  the  discourse  from  the  postulational  base.  It  is 
the  purpose  of  the  present  chapter  partially  to  remedy  this  deficiency. 

It  would  be  a  well  nigh  hopeless  task  to  discuss  modern  considerations 
of  logic  by  the  use  of  only  ordinary  language.  A  symbolic  language  has 
become  necessary  in  order  to  achieve  the  required  exact  scientific  treatment 
of  the  subject.  Because  of  the  presence  of  such  symbolism,  the  resulting 
treatment  is  known  as  symbolic ,  or  mathematical ,  logic.  In  symbolic  logic  the 
various  relations  among  propositions,  classes,  etc.,  are  represented  by  formulas 
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whose  meanings  are  free  from  the  ambiguities  so  common  to  ordinary 
language.  It  becomes  possible  to  develop  the  subject  from  a  set  of  initial 
formulas  in  accordance  with  certain  clearly  prescribed  rules  of  formal 
transformation,  much  like  the  development  of  a  piece  of  common  algebra. 
Also,  and  again  as  in  the  development  of  a  piece  of  common  algebra,  the 
advantages  of  the  symbolic  language  over  ordinary  language  insofar  as 
compactness  and  ease  of  comprehension  are  concerned  are  very  great. 

Leibniz  is  regarded  as  the  first  to  consider  seriously  the  desirability  of  a 
symbolic  logic.  One  of  his  earliest  works  was  an  essay,  De  arte  combinatorial 
published  in  1666,  in  which  he  indicated  his  belief  in  the  possibility  of  a 
universal  scientific  language,  expressed  in  an  economical  and  workable 
symbolism  for  guidance  in  the  reasoning  process.  Returning  to  these  ideas 
between  the  years  1679  and  1690,  Leibniz  made  considerable  headway 
toward  the  creation  of  a  symbolic  logic,  and  he  formulated  a  number  of  the 
concepts  that  are  so  important  in  modern  studies. 

An  important  renewal  of  interest  in  symbolic  logic  took  place  in  1847 
when  George  Boole  (1815—1864)  published  his  little  pamphlet  entitled 
The  Mathematical  Analysis  of  Logic,  Being  an  Essay  towards  a  Calculus  of  Deductive 
Reasoning.  Another  paper  followed  in  1848,  and  finally,  in  1854,  Boole  gave 
a  notable  exposition  of  his  ideas  in  the  work,  An  Investigation  into  the  Laws  of 
Thought ,  on  Which  Are  Founded  the  Mathematical  Theories  of  Logic  and  Probability .” 

Augustus  De  Morgan  (1806-1871)  was  a  contemporary  of  Boole,  and 
his  treatise  on  Formal  Logic ;  or,  the  Calculus  of  Inference,  Necessary  and  Probable, 
published  in  1847,  in  some  ways  went  considerably  beyond  Boole.  Later 
De  Morgan  also  made  extended  studies  of  the  hitherto  neglected  logic  of 
relations. 

In  the  United  States,  outstanding  work  in  the  field  was  contributed  by 
Charles  Sanders  Peirce  (1839-1914),  son  of  the  distinguished  Harvard 
mathematician,  Benjamin  Peirce.  Peirce  rediscovered  some  of  the  principles 
enunciated  by  his  predecessors.  It  is  unfortunate  that  his  work  appeared 
somewhat  out  of  the  stream  of  normal  development;  only  in  comparatively 
recent  times  has  the  merit  of  much  of  Peirce’s  thought  been  properly 
appreciated. 

The  notions  of  Boole  were  given  a  remarkable  completeness  in  the 
massive  treatise  by  Ernst  Schroder  (1841—1902)  entitled  Vorlesungen  uber 
die  Algebra  der  Logic,  published  during  the  period  1890—1895.  In  fact,  modern 
logicians  are  inclined  to  characterize  symbolic  logic  in  the  Boolean  tradition 
by  the  term  Boole-Schroder  algebra.  Considerable  work  is  still  being  done  in 
Boolean  algebra,  and  many  papers  upon  the  subject  are  to  be  found  in 
present-day  research  journals. 

A  still  more  modern  approach  to  symbolic  logic  originated  with  the 
work  of  the  German  logician  Gottlob  Frege  (1848—1925)  during  the  period 
1879-1903,  and  with  the  studies  of  Peano  referred  to  in  Section  4.1.  Peano’s 
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work  was  motivated  by  a  desire  to  express  all  mathematics  in  terms  of  a 
logical  calculus,  and  Frege’s  work  stemmed  from  the  need  of  a  sounder 
foundation  for  mathematics.  Frege’s  Be  griff sschrift  appeared  in  1879,  and 
his  historically  important  Grundgesetze  der  Arithmetik  in  1893-1903;  the 
Formulaire  de  mathematiques  of  Peano  and  his  co-workers  began  its  appearance 
in  1 894.  The  work  started  by  Frege  and  Peano  led  directly  to  the  very  influen¬ 
tial  and  monumental  Principia  mathematica  (1910-1913)  of  Whitehead  and 
Russell.  The  basic  idea  of  this  work  is  the  identification  of  much  of  mathe¬ 
matics  with  logic  by  the  deduction  of  the  natural  number  system,  and  hence 
of  the  great  bulk  of  existing  mathematics,  from  a  set  of  premises  or  postulates 
for  logic  itself.  In  1934-1939  appeared  the  comprehensive  Grundlagen  der 
Mathematik  of  David  Plilbert  and  Paul  Bernays.  This  work,  based  upon  a 
series  of  papers  and  university  lectures  given  by  Hilbert,  attempts  to  build 
up  mathematics  by  the  use  of  symbolic  logic  in  a  new  way  which  renders 
possible  the  establishment  of  the  consistency  of  mathematics. 

At  the  present  time  elaborate  studies  in  the  field  of  symbolic  logic 
are  being  pursued  by  many  mathematicians,  chiefly  as  a  result  of  the  impetus 
given  to  the  work  by  the  publication  of  the  Principia  mathematica.  A  periodical, 
known  as  the  Journal  of  Symbolic  Logic ,  was  established  in  1935  to  publicize 
the  writings  of  this  group. 

In  this  section  and  the  next  we  shall  endeavor  to  give  a  slight  idea  of 
the  nature  of  symbolic  logic,  restricting  ourselves  to  the  so-called  propositional 
calculus  as  developed  by  Whitehead  and  Russell.  In  the  present  section  we 
shall  introduce  needed  concepts  and  symbolism,  and  in  the  next  section  we 
shall  outline  a  postulational  approach  to  the  subject.  Of  course,  in  such  a 
limited  space,  our  treatment  must  be  brief  and  incomplete. 

A  proposition  is  to  be  understood  as  any  statement  concerning  which  it 
is  meaningful  to  say  that  its  content  is  true  or  false,  *  whether  or  not  we  know 
which  of  these  terms  actually  applies.  Examples  of  propositions  are:  “Spring 
is  a  season,”  “8  is  a  prime  number,”  “The  billionth  digit  in  the  decimal 
expansion  of  7 r  is  7.”  The  first  proposition  is  true,  the  second  false,  and 
though  the  truth  or  falsity  of  the  third  is  not  known,  we  shall  consider  it 
meaningful  to  say  that  the  third  statement  is  true  or  false.  Propositions  will 
be  denoted  in  the  symbolic  logic  by  the  lower-case  letters  m,  n,  p,  q,  r,  •  • , 
and  we  shall  agree  that  when  we  assert  a  proposition  /?,  without  qualification 
as  to  its  truth  or  falsity,  we  mean  to  imply  that  p  is  true. 

Propositions  can  be  combined  in  various  ways  to  form  new  propositions. 
For  example,  from  the  two  propositions  “Spring  is  a  season,”  “8  is  a  prime 
number,”  we  can  construct  the  new  propositions:  “Spring  is  a  season  and 
8  is  a  prime  number,”  “Spring  is  a  season  or  8  is  a  prime  number,”  “ If 
spring  is  a  season,  then  8  is  a  prime  number,”  and  “Spring  is  a  season  if 

*We  shall  not  go  into  the  epistemological  question  of  the  meaning  of  “true”  and 

of  “false.” 
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and\only  if  8  is  a  prime  number.”  Finally,  from  the  single  proposition,  “Spring 
is  a  season,”  we  can  construct  the  new  proposition,  “Spring  is  not  a  season”; 
that  is,  we  can  construct  the  proposition  which  denies  the  first  proposition. 

The  above  combinations  of  propositions  are  rendered  verbally  by  the 
words  and ,  or,  if-then ,  if  and  only  if ,  not.  In  symbolic  logic  these  fundamental 
combinations  of  propositions  are  rendered  by  suitable  symbolism.  For  this 
purpose  we  introduce  the  following  five  symbols: 

1.  p  A  q  (read  “p  and  q”)  represents  the  proposition  which  is  true  when 

and  only  when  both  p  and  q  are  true.  A  proposition  of  this  form  is 

called  a  conjunction. 

2.  p  V  q  (read  “p  or  q”)  represents  the  proposition  which  is  true  when 

and  only  when  at  least  one  of  the  two  propositions  p,  q  is  true.  A  prop¬ 

osition  of  this  form  is  called  a  disjunction. 

3.  p  — >  q  (read  “if  p,  then  q”)  represents  the  proposition  which  is  false 
when  and  only  when  p  is  true  and  q  is  false.  A  proposition  of  this  form 
is  called  an  implication. 

4.  p  ^  q  (read  “p  if  and  only  if  q”)  represents  the  proposition  which  is 
true  when  and  only  when  p  and  q  are  both  true  or  both  false.  The 
assertion,  p  »  q,  means,  therefore,  that  p  and  q  have  the  same  truth 
value.  A  proposition  of  this  form  is  called  an  equivalence. 

5.  p'  (read  “ not  p”)  represents  the  denial,  or  contradiction,  of  p.  That  is, 

pf  represents  that  proposition  which  is  true  when  p  is  false  and  which 

is  false  when  p  is  true.  A  proposition  of  this  form  is  called  a  negation* 

We  observe  that  A,  V,  — and  are  symbols  denoting  binary  opera¬ 
tions  performed  on  propositions,  and  that  '  is  a  symbol  denoting  a  unary 
operation  performed  on  propositions. 

It  is  to  be  noted  that  the  logical  meanings  of  and ,  or,  if -then,  and  if  and 

only  if  differ  somewhat  from  the  customary  uses  of  these  connectives  in 

ordinary  language. 

For  example,  in  ordinary  language,  and  is  commonly  used  to  conjoin 
two  propositions  that  are  relevant  to  each  other  in  what  might  be  called 
connected  discourse,  as  in  a  coherent  description  of  a  sequence  of  events, 
such  as 

He  took  the  train  and  arrived  in  Boston. 

In  logic,  however,  and  is  used  to  connect  any  two  propositions  whatever, 
without  regard  to  their  possible  relevance  or  bearing  on  one  another,  as  in 
the  example, 

Spring  is  a  seaon  and  8  is  a  prime  number. 

*The  symbols  for  conjunction,  disjunction,  implication,  equivalence,  and  negation 
are  by  no  means  standardized,  and  vary  considerably  among  authors.  Thus  White- 
head  and  Russell  write  p  •  q,  p  V  <7,  p  I)  q,  p  =  q,  ~ P >  an<^  Hilbert  writes  p  &  q, 
P  V  q,  p  ->  q,  p  ~  q,p- 
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Similarly,  in  ordinary  language,  the  connective  or  appears  in  two  senses, 
the  exclusive  sense  of  the  Latin  aut  (“ p  or  q  but  not  both”)  and  the  inclusive 
sense  of  the  Latin  vel  (“/?  or  q  or  both”).  It  is  in  the  latter  sense,  the  sense  of 
the  familiar  juristic  barbarism  “and/or,”  that  or  is  used  in  logic,  *  and,  again, 
no  regard  is  paid  to  the  possible  relevancy  or  bearing  of  one  proposition 
upon  the  other. 

Particularly  interesting  is  the  logical  use  of  if-then.  In  ordinary  language, 
the  compound  statement,  “If  /?,  then  ?,”  is  taken  to  indicate  a  relation  of 
premise  and  conclusion  or  of  cause  and  effect,  as  in  the  statement, 

If  it  is  raining,  then  we  shall  not  go. 

In  logic,  on  the  other  hand,  the  implication  p  — »  q  may  connect  any  two 
propositions  p  and  q  whatever,  and,  by  Definition  3  above,  the  implication 
is  to  be  considered  as  false  when  and  only  when  p  is  true  and  q  is  false. 
We  thus  have  the  interesting  situation  that  the  implication  p  — >  q  is  true 
whenever/?  is  false  and  also  whenever  q  is  true.  Thus  all  three  of  the  following 
implications  are  to  be  considered  true: 

If  7  is  a  prime  number,  then  2  times  2  is  4, 

If  8  is  a  prime  number,  then  2  times  2  is  4, 

If  8  is  a  prime  number,  then  2  times  2  is  5. 

Though  somewhat  startling,  these  results  need  not  be  unexpected  if  we 
remember  that  we  are  concerned  here,  not  with  the  meanings  of  propositions, 
but  only  with  their  truth  values.  An  implication  which  is  based  on  a  relation 
of  premise  and  conclusion  or  of  cause  and  effect  depends  upon  the  structure 
of  the  component  propositions  rather  than  upon  their  truth  or  falsity,  f 

Finally,  the  assertion,  “/?  if  and  only  if  ^,”  is  not  intended  in  logic  to 
imply  that  the  component  propositions  p  and  q  have  the  same  meaning  or 
significance,  but  merely  that  the  assertion  is  true  when  and  only  when 
p  and  q  are  both  true  or  both  false.  Thus  the  two  statements, 

7  is  a  prime  number  if  and  only  if  2  times  2  is  4, 

8  is  a  prime  number  if  and  only  if  2  times  2  is  5, 

are  both  to  be  considered  true. 

The  meaning  of  the  logcal  symbol  V  may  be  kept  in  mind  by  associating  it  with 
the  initial  letter  of  the  Latin  word  vel. 

fThe  definition  of  implication  in  logic  is  a  controversial  matter,  and  other  analyses 
of  the  nature  of  implication  have  been  proposed.  Implication  as  defined  above  is 
called  material  implication.  C.  I.  Lewis  has  introduced  a  concept  called  strict  implication , 
which  seems  more  nearly  to  correspond  to  the  relation  holding  when  a  conclusion  is 
said  to  be  deducible  from  premises,  but  as  yet  there  is  no  definition  of  implication 
dependent  on  propositional  structure  that  has  won  general  acceptance  among 
logicians.  In  any  event,  material  implication  is  bound  up  in  any  idea  of  implication, 

and,  no  matter  what  definition  of  implication  be  adopted,  at  least  we  have  a  material 
implication. 
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Probably  the  neatest  and  most  compact  way  of  summarizing  the  logical 
meanings  of  conjunction,  disjunction,  implication,  equivalence,  and  negation 
is  by  the  use  of  truth  tables.  In  these  tables,  the  symbol  T  is  employed  to 
denote  that  the  corresponding  proposition  is  true,  F  that  it  is  false. 

Let  us  illustrate  by  forming  the  truth  table  for  the  operation  of  con¬ 
junction. 

CONJUNCTION 


p 

q 

p  A  q 

r 

T 

T 

T 

F 

F 

F 

T 

F 

F 

F 

F 

This  table  is  merely  an  expansion  of  the  definition  of  conjunction  given 
above.  It  says  that  p  A  q  is  true  if  p  and  q  are  each  true,  and  that  p  A  q  is 
false  in  every  other  case.  To  form  such  a  table  one  must  list  all  possible 
combinations  of  T  and  F  for  the  component  propositions  (p  and  q  in  this 
case)  involved.  The  truth  or  falsity  of  each  combination  is  then  entered  in 
the  final  column. 

Truth  tables  for  disjunction,  implication,  equivalence,  and  negation  are 
as  follows: 


DISJUNCTION 


P 

q 

P  v  q 

T 

T 

T 

T 

F 

r 

F 

T 

T 

F 

F 

F 

IMPLICATION 


P 

q 

p  *  q 

T 

T 

T 

T 

F 

F 

F 

T 

T 

F 

F 

T 

EQUIVALENCE 


NEGATION 


P 

q 

p  ^  q 

P 

P 

T 

T 

T 

T 

F 

T 

F 

F 

F 

T 

F 

T 

F 

F 

F 

T 

The  composite  propositions  considered  above  may  be  modified  or 
combined  into  more  elaborate  composite  propositions,  such  as 


[p  A  (p-+q)]~>q, 

and  the  truth  value  of  the  new  proposition  for  any  combination  of  truth 
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values  of  the  fundamental  propositions  p  and  q  may  be  determined  by  means 
of  the  truth  tables  for  conjunction,  disjunction,  implication,  equivalence, 
and  negation.  Thus,  for  the  proposition  just  cited,  we  have 


p 

7 

p->q 

P  A  (p- 

*  7) 

[p  A  (p  — > -  q)  ]  — >  7 

T 

T 

T 

T 

T 

7 

F 

F 

F 

T 

F 

T 

T 

F 

T 

F 

F 

T 

F 

r 

The  first,  second,  and  last  columns  constitute  the  truth  table  for  the  new 
proposition;  the  third  and  fourth  columns  serve  merely  as  aids  for  obtaining 
the  last  column.  This  truth  table  happens  to  possess  the  curious  feature  of 
having  nothing  but  7’s  in  the  last  column.  This  means  that  the  proposition 
concerned  has  the  truth  value  T  no  matter  what  truth  value  p  has  and  no 
matter  what  truth  value  q  has.  Propositions  of  this  special  sort  are  called 
tautologies ,  or  laws  of  logic ,  and  they  play  an  essential  role  in  the  study  of 
logic.  The  reader  can  easily  show  that  the  following  propositions  are  all 
tautologies. 


SOME  LAWS  OF  LOGIC 


Law 

Name  of  Law 

P  v  p' 

law  of  excluded  middle 

(P  A  P'Y 

law  of  contradiction 

[( P  — ►  7)  A  (q  ->  r)  ]  -»  (p  s  r) 

law  of  the  syllogism 

law  of  double  negation 

(p->q)^  (qf  —*p') 

law  of  contraposition 

If  one  should  form  the  truth  tables  for  the  two  propositions  p  — >  q  and 
q'  -a  p\  it  would  be  found  that  the  tables  agree  with  each  other  in  point 
of  truth  and  falsity.  Two  propositions  m  and  n  of  this  sort  are  said  to  be 
logically  equivalent ,  and  it  follows  that  the  proposition  m  <— >  n  is  a  tautology, 
or  law  of  logic.  Thus  the  proposition  (p  — ►  q)  (q'  -»  p')  is  a  law  of  logic, 
as  has  been  indicated  in  the  above  list  of  some  of  the  laws  of  logic.  The 
importance  of  knowing  that  two  propositions  are  logically  equivalent  lies 
in  the  fact  that  we  can  alter  the  structure  of  one  proposition  into  that  of  the 
logically  equivalent  proposition  with  full  confidence  that  we  are  not  thereby 
affecting  the  truth  value  of  the  original  proposition.  Thus  the  proposition 
q'-*P'  maY,  anywhere  it  appears,  be  replaced  by  the  proposition  p  — >  q. 
Similarly,  (/>')'  may,  anywhere  it  appears,  be  replaced  by  p. 

We  leave  to  the  reader  the  establishment  of  the  logical  equivalence  of 
the  following  pairs  of  propositions: 
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SOME  PAIRS  OF  LOGICALLY  EQUIVALENT  PROPOSITIONS 


py  q 

(P'  A  q’Y 

p-*q 

(P  A  q'Y 

p  ^  q 

(p  A  q'Y  A  (q  A  p'Y 

P  a  q 

(P'  V  q'Y 

p-*q 

P'  V  q 

P  q 

[(/>' V  ?)' V  (?' V  pry 

P  A  q 

(p  -  ?')' 

P  V  q 

p'  -*  ? 

p  ►  q 

[(p  —>?)—>  (q  -^pYY 

These  are  interesting  logical  equivalences,  for  they  show  that  conjunction, 
disjunction,  implication,  equivalence,  and  negation  are,  to  some  extent, 
interdefinable,  and  that  some  of  the  logical  connectives  are  therefore  dis¬ 
pensable.  Thus  the  first  three  logical  equivalences  exhibit,  respectively,  ways 
of  expressing  disjunction,  implication,  and  equivalence  in  terms  of  conjunc¬ 
tion  and  negation;  the  middle  three  logical  equivalences  exhibit,  respectively, 
how  we  may  express  conjunction,  implication,  and  equivalence  in  terms  of 
disjunction  and  negation;  the  last  three  logical  equivalences  exhibit,  respec¬ 
tively,  how  we  may  render  conjunction,  disjunction,  and  equivalence  in 
terms  of  implication  and  negation. 

It  can  be  shown  that  negation  is  indispensable  for  expressing  combina¬ 
tions  of  propositions,  and  that  not  all  the  fundamental  logical  connectives 
can  be  rendered  by  equivalence  and  negation. 

It  is  worth  noting  that  disjunction  can  be  expressed  by  implication  alone, 
without  the  use  of  negation,  for  p  V  q  and  (p  — >  q)  — >  q  are  logically  equiva¬ 
lent.  It  is  not  possible  to  do  this  for  conjunction. 

H.  M.  Sheffer  showed,  in  1913,  that  all  the  fundamental  logical  con¬ 
nectives  can  be  rendered  by  means  of  a  single  logical  symbol,  which  he  called 
stroke.  Sheffer’s  stroke  combination  is  written  as  p/q,  and  means:  “Not 
p  or  not  q”  The  reader  may  easily  show  that  pjp  and  p'  are  logically  equiva¬ 
lent  and  that  (p/p)/(q/q)  and  p  V  q  are  logically  equivalent.  Since  disjunction 
and  negation  can  be  expressed  by  Sheffer’s  stroke  symbol,  it  follows  that 
the  other  fundamental  logical  connectives  can  also  be  expressed  by  means  of 
this  symbol. 

9.2  The  Calculus  of  Propositions 

In  the  employment  of  the  postulational  method  we  deduce  theorems 
from  postulates  or  from  earlier  theorems.  In  making  these  deductions  we 
engage  in  an  argumentation  from  premises  to  conclusion,  and  our  reasoning 
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takes  the  implicative  form  of  “if  such  and  such,  then  so  and  so.”  We  are 
not  concerned  with  the  truth  or  falsity  of  either  our  premises  or  our  conclusion 
but  are  concerned  only  with  the  validity  of  the  argument  leading  from  the 
one  to  the  other.  That  is,  we  insist  that  our  argument  be  formally  correct, 
or  that  the  involved  implication  be  true  regardless  of  the  truth  or  falsity  of 
the  premises  and  the  conclusion.  It  follows  that  the  implication  must  be  a 
tautologous  one.  Conversely,  if  the  implication  is  tautologous,  then  the 
argument  is  valid.  Thus  all  that  is  required  for  testing  our  reasoning  for 
validity  is  to  ascertain  whether  the  involved  implication  is  or  is  not  a 
tautology.  In  short,  propositional  formulas  that  are  tautologies  represent 
what  we  call  valid  argument. 

In  view  of  the  above,  it  is  evident  that  a  primary  task  of  logic  is  to  ferret 
out  tautologies,  that  is,  to  find  those  combinations  of  component  pro¬ 
positions  p}  <7,  r,  •  •  •  which  are  true  independently  of  the  truth  values  of  the 
components  themselves.  These  tautologies  are  the  laws  of  the  logical  system, 
and  they  constitute  the  structure  of  what  is  to  be  regarded  as  correct  formal 
reasoning.  Now,  in  the  method  of  truth  tables,  we  possess  an  adequate 
and  sure  means  of  deciding  whether  a  composite  proposition  is  or  is  not  a 
tautology.  The  application  of  this  method  will,  however,  result  in  a  purely 
random  and  unsystematic  catalogue  of  tautologies.  It  is  the  purpose  of  the 
present  section  to  explain  a  method  in  which  the  tautologies  are  arrived  at 
in  an  orderly  and  interconnected  manner.  We  shall  show  that  from  a  small 
and  specially  selected  set  of  tautologies,  the  remaining  tautologies  can  be 
obtained  in  accordance  with  certain  precise  rules.  Furthermore,  the  new 
method  will  yield  only  tautologies,  and  we  shall  not  have  the  task  of  sorting 
out  the  tautologies  from  the  nontautologies,  as  must  be  done  in  the  employ¬ 
ment  of  the  truth-table  method.  Since  the  new  method  obtains  tautologies 
by  a  procedure  of  symbolic  calculation,  the  new  method  is  referred  to  as 
the  calculus  of  propositions. 

The  approach  to  the  calculus  of  propositions  that  we  shall  now  explain 
is  essentially  that  given  by  Whitehead  and  Russell  in  their  Principia  mathe- 
matica.  A  remarkable  feature  of  the  development  is  that  it  employs  the 
postulational  method.  A  small  selection  is  made  from  the  set  of  all  tautologies 
to  serve  as  the  postulates  of  the  development,  and  then  several  formal  rules 
are  given  in  accordance  with  which  all  other  tautologies  can  be  obtained  from 
the  selected  few.  The  rules  play  the  same  role  in  the  development  of  the 
propositional  calculus  that  logical  inference  plays  in  the  usual  development 
of  a  branch  of  mathematics.  Of  course,  logical  inference,  in  the  common 
sense,  cannot  be  used  here,  for  this  very  logical  inference  now  constitutes 
the  object  of  our  study. 

We  shall  now  list,  and  briefly  comment  upon,  the  primitive  terms,  the 
postulates,  and  the  rules  for  obtaining  theorems  in  the  calculus  of  prop¬ 
ositions. 
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Primitive  terms 

For  the  primitive,  or  undefined,  terms  of  the  calculus  of  propositions 
we  choose:  (1)  a  set  P  of  elements  p,  q,  r,  •  •  •,  called  propositions ,  (2)  a  binary 
operation  performed  on  the  elements  of  P  and  denoted  by  V,  and  (3)  a 
unary  operation  performed  on  the  elements  of  P  and  denoted  by  ' . 

Because  of  the  interdefinability  of  the  logical  connectives  (pointed  out 
at  the  end  of  Section  9.1),  there  is  no  need  also  to  include  among  the  primitive 
terms  the  binary  operations  A,  — »,  and  <->.  It  is  to  be  noted  that  if p  and  q 
are  members  of  P  then  p  V  q  and  p'  also  are  members  of  P;  that  is,  they 
also  are  propositions. 

Postulates ,  or  primitive  tautologies 

All  tautologies  are  propositions,  but  a  proposition  may  or  may  not  be  a 
tautology.  From  among  the  infinitely  many  tautologies,  four  are  selected  to 
serve  as  postulates,  or  as  primitive  tautologies.  These  primitive  tautologies  will 
here  be  denoted  by  LI,  L2,  L3,  L4,  and  for  convenience  in  stating  them 
we  precede  them  by  a  definition. 

Definition  p  —>  q  means  pr  V  q. 

LI :  (p  V  p)  — >  p. 

L2:  q  — >  (p  V  q). 

L3:  (p  V  q)  -»  (q  V  p). 

L4:  (q  >  r)  >  [(p  V  q)  — »  (p  V  r)  ]. 

In  Principia  mathematica ,  Postulate  LI  is  called  the  principle  of  tautology. 
It  is  clear  that  we  could  have  asserted  something  stronger  than  just  impli¬ 
cation,  for  actually  (p  V  p)  p  is  a  tautology.  Since,  however,  p  — >  (p  V  p) 
can  be  proved,  this  half  of  the  equivalence  need  not  be  assumed. 

Postulate  L2  is  motivated  by  the  fact  that  a  disjunction  is  true  when  one 
of  its  components  is  true.  In  Principia  mathematica ,  this  postulate  is  called 
the  principle  of  addition. 

Postulate  L3,  called  the  principle  of  permutation ,  is  motivated  by  the 
commutative  nature  of  disjunction.  Again,  an  equivalence  could  have  been 
asserted  here  instead  of  just  an  implication,  but  then  more  would  be  assumed 
than  is  necessary. 

Postulate  L4  is  called  the  principle  of  summation  and  is  motivated  by  the 
fact  that  in  an  implication  a  disjunct  may  be  added  to  both  the  antecedent 
and  the  consequent,  and  the  implication  will  still  hold.  There  is  a  similarity 
between  this  postulate  and  the  familiar  law  of  arithmetic  of  natural  numbers: 
If  a  >  b,  then  a  T  c  >  b  +  c. 

Rules  for  obtaining  theorems  or  derived  tautologies 

There  are  four  rules  by  means  of  which  we  shall  be  permitted  to  obtain 
derived  tautologies  from  given  tautlogies.  These  rules  are 
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R1  (rule  of  substitution) :  We  may  substitute  any  proposition  for  all  occurrences 
of  a  proposition  represented  by  a  single  letter  in  a  given  tautology ,  and  thus  obtain  a 
derived  tautology. 

For  example,  the  substitution  of  the  proposition  p  V  q  for  the  proposi¬ 
tion  q  in  the  tautology  L2  gives  the  new  tautology 

(P  V  q)  — ►  [p  V  (p  V  q)  ]. 

R2  (rule  of  definitional  substitution) :  We  may  replace  any  expression  in  a 
given  tautology  by  another  expression  which  is  definitionally  identical  with  it ,  and 
thus  obtain  a  derived  tautology. 

For  example,  the  replacement  of  q  — ->  r  by  q'  V  r  in  the  tautology  L4 
gives  the  new  tautology 


W  V  r)  — >  [{p  V  q)  — »  (p  V  r)  ]. 


R3  (rule  of  detachment,  or  rule  of  implication):  From  two  given  tautologies 
m  and  m  — »  n,  the  derived  tautology  n  may  be  obtained. 

To  facilitate  the  statement  of  the  fourth  rule  we  first  introduce  the 
following  definition: 

Definition  2.  p  A  q  means  (//  V  q')' . 

We  now  state  Rule  R4.* 

R4  (rule  of  adjunction):  From  two  given  tautologies  m  and  n,  the  derived 
tautology  m  A  n  may  be  obtained. 

With  the  above  postulates  and  rules  of  procedure  in  mind,  we  now 
turn  to  some  of  the  theorems  of  the  calculus  of  propositions.  Only  enough 
demonstrations  will  be  presented  to  give  some  idea  of  the  workings  of  the 
calculus;  for  the  rest,  we  shall  be  content  with  merely  the  statements  of  the 
theorems. 


Theorem  1 .  (q  — ►  r)  — >  [(p  — >  q)  — >  (p  — ■>  r)  ]. 

Proof :  (q—+r)—*  [(p  V  q)  — »  (p  V  r)  ]  (by  Post.  L4) 

(q  —>  r)  — »  [(//  V  q)  — >  {pr  V  r)  ]  (substituting  p'  for  p) 
(q  >  r)  »  [{p-^q)  ->  (p— >r)  ]  (by  Def.  1) 

(This  theorem  is  one  form  of  the  transitivity  of  implication.) 


Theorem  2.  p —>  (p  V  p). 

Proof :  q~^{py  q)  (by  Post.  L2) 

/?—>(/?  V  />)  (substituting  p  for  q) 

^Actually,  Rule  R4  can  be  established  from  the  postulates  and  Rules  Rl,  R2,  R3, 
but  to  carry  out  the  establishment  is  a  lengthy  and  involved  task.  Since  we  shall  need 
the  rule  we  here  simply  assume  it. 
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Theorem  3.  p  — > p. 

Proof :  ^  r)  — >  [(/? 

[(P  V  p)  ~^p] 

ip  V/)  ->/> 

{ A  — ^  (A  v  p)} 

p—*(pvp) 

P~*P 


q)-+(p-*r)]  (byTh.  1 

-  [\p-+(p  V  p)\  ->(p~>p)] 

(substituting  p  V  p  for  q  and  p  for  r 

(by  Post.  LI) 

►  (p  — > p)  (by  Rule  R3) 

(by  Th.  2) 
(by  Rule  R3) 

(This  theorem  asserts  that  any  proposition  implies  itself,  or  that 
implication  is  reflexive.) 


Theorem  4.  p'  V  p. 
Proof :  p  — >  p 

p'  V  p 


(by  Th.  3) 
(by  Def.  1) 


Theorem  5.  p  V  pf . 
Proof :  (p  V  q)  - 

(P'  V  p) 
p'  V  p 
pw  p' 


(■ q  v  p) 

►  (p  v  p') 


(by  Post.  L3) 
(substituting  p'  for  p  and  p  for  q) 

(by  Th.  4) 
(by  Rule  R3) 

(This  is  the  law  of  excluded  middle ,  which  says,  “Either  p  is  true  or  p  is 
false,”  or  “Either  p  is  true  or  not -p  is  true.”) 


Theorem  6. 
Proof : 


P'- 


ip-*P’) 

(p  V  p)  — >/>  (by  Post.  LI) 

(pf  V  p')  — >  p'  (substituting  pr  for  p) 

(P-*P')-*P'  (bY  Def-  ^ 

(This  theorem,  which  asserts  that  any  proposition  which  implies  its 
own  falsity  is  false,  is  the  principle  of  reductio  ad  absurdum  in  its  simplest  form. 
It  is  possible  to  prove  more  general  forms  of  reductio  ad  absurdum ,  such  as 


ip  — >  q)  lip  — *  v')  ~^P'  1- 

This  latter  theorem  says  that  a  proposition  p  which  implies  both  q  and  q' 
is  false.  Later  we  shall  state  the  most  general  form  of  the  principle  of  reductio 
ad  absurdum .) 

(by  Th.  5) 
(substituting  pr  for  p) 
(by  Def.  1) 

Theorem  8.  p  V  \  {pr)'\' • 

Proof :  (q  — >  r)  — »  [(/>  V  ?)  — >  (/>  V  r)  ]  (by  Post.  L4) 

[p'  ->  {  (p'Y)'  ]->[(/>  V  p')  -» (p  V  {(/-')'!')] 

(substituting  //  for  ?  and  { (p/) r } ,  for  r) 


Theorem  7.  p— >  (/>')'. 
Proof :  p  V  p' 

p'  V  (/»')' 
p—>(p'Y 
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Theorem 
Proof : 


Theorem 

Proof : 

Definition  3.  p  <— >■  q  means  (p — >  q)  A  (q — >p). 

Theorem  11.  P  (P'Y- 

Proof :  |> -*(//)']  A  [(/?')'  — ^ A ]  (by  Th.  10) 

P  ( p')'  (by  Def.  3) 

(This  is  the  law  of  double  negation.) 

Since  the  aim  of  the  above  demonstrations  is  merely  to  present  some 
idea  of  the  nature  of  the  calculus  of  propositions  rather  than  to  give  a 
complete  account  of  its  workings,  we  shall  simply  state  the  following  addi¬ 
tional  theorems: 

Theorem  12.  (p  V  q)  (//  A  q')'  <->  (//  — >  q). 

Theorem  13.  (p  — >  q)  (pr  V  q)  <->  (p  A  q')'. 

Theorem  14.  (p  A  q)  <->  (//  V  <7')'  (/>  — >  ?')'. 

(Theorems  12,  13,  and  14  show  the  logical  equivalences  between  the 
conjunctive,  disjunctive,  and  implicative  types  of  propositions.  The  signifi¬ 
cance  of  these  logical  equivalences  has  already  been  discussed.) 

Theorem  1 5.  (//  —sp)~^p. 

(This  theorem  sanctions  a  frequently  used  type  of  argument  in  which 
a  proposition  p  is  proved  by  showing  that  the  assumption  of  the  falsity  of 
p  implies  p.) 

Theorem  16.  (p  — >  q)  (q'  — >  p'). 

( This  is  the  law  of  contraposition ,  and  it  permits  us  to  prove  p  — >  q  by 
proving  q'  — p'.) 

Theorem  17.  [(p—>q)  A  /]—>//. 

(This  theorem  asserts  that  if  p  implies  q,  and  q  is  false,  then  p  is  false.) 


P-(P'Y 

P'-l(P'Y\' 

(P  vp')->(p  v  {(/>')'}') 
pvp' 

P  v  {(//)'}' 

9.  (p'Y  ->  p. 

(P  V  q)  — >  (q  V  p) 

(pv{(p'YY)->({(p'Y  i'v 
PViip'YV 
{( P'YYvp 
(P'Y  ~*P 

10.  [p  — »  (p'Y  )  A  [(/>')'->/>]. 
P-(P'Y 

(P'Y-+P 

[p-*(P'Y]  a  [(/>')'->/>] 


(by  Th.  7) 
(substituting  p'  for  /?) 
(by  Rule  R3) 
(by  Th.  5) 
(by  Rule  R3) 

(by  Post.  L3) 
(substituting  { (p'Y) '  for  <7) 
(by  Th.  8) 
(by  Rule  R3) 
(by  Def.  1) 

(by  Th.  7) 
(by  Th.  9) 
(by  Rule  R4) 
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Theorem  18.  \(p  V  q)  A  q']—>p. 

(This  theorem  asserts  that  if  p  or  q  is  true,  and  q  is  false,  then  p  is  true.) 
Theorem  19.  [(p q)  A  (q  — » r)  ]  —>  (p  — »  r). 

(This  is  the  law  of  the  syllogism  and  is  another  form  of  the  transitivity 
of  implication.) 

Theorem  20.  [p  A  {  [(/?  A  qr)  — * ►  r  ]  A  [{p  A  q')~^r' )  }]—>?. 

(This  theorem,  which  asserts  that  if  p  is  true,  and  if  p  and  q'  imply  both 
r  and  r',  then  q  is  true,  is  the  principle  of  reductio  ad  .absurdum  in  its  most 
general  form.  It  is  frequently  used  in  mathematical  demonstrations,  where  p 
represents  the  entire  set  of  postulates  and  q  is  the  theorem  we  wish  to  prove 
by  reductio  ad  absurdum.  We  assume  q\  or  that  q  is  false.  The  proof  then 
consists  in  showing  that  the  original  postulates  together  with  the  falsity 
of  q  imply  a  pair  of  contradictory  propositions,  r  and  r' .  When  this  is 
established,  we  may  accept  q  as  true.) 

Theorem  21.  q-^(p-^q)- 

Theorem  22.  pr  — >  (p  — >  q). 

(Theorems  21  and  22  exhibit  two  features  of  material  implication  that 
we  have  considered  earlier.  Thus  Theorem  21  asserts  that  a  true  proposition 
q  is  implied  by  any  proposition  p,  and  Theorem  22  asserts  that  a  false  pro¬ 
position  p  implies  any  proposition  q .) 

Theorem  23.  (p  A  p')' . 

(This  is  the  law  of  contradiction ,  which  asserts  that  it  is  false  that  both 
p  and  not -p  are  true.) 

The  above  theorems  are  sufficient  to  illustrate  the  general  nature  of  the 
calculus  of  propositions,  and  it  should  now  be  seen  how  the  principles  of 
the  argument  of  compound  propositions  are  derivable  from  this  calculus. 
But  though  the  calculus  of  propositions  is  adequate  for  the  precise  rendering 
of  those  logical  connections  in  which  the  propositions  appear  as  unanalyzed 
wholes,  it  is  clear  that  the  calculus  is  not  adequate  for  the  purposes  of  logic 
in  general,  for  there  are  logical  inferences  of  the  sort  that  depend  not  only 
upon  the  propositions  as  wholes,  but  also  upon  the  inner  content  of  the 
propositions  themselves.  For  example,  the  calculus  of  propositions  cannot 
render  the  logical  relation  expressed  in  the  classical  syllogism: 

All  men  are  mortal. 

Socrates  is  a  man. 

Therefore  Socrates  is  mortal. 

The  reason  is  evident;  the  inference  made  in  this  syllogism  depends  upon 
the  relations  of  subject  to  predicate  in  the  various  sentences  of  the  syllogism, 
and  not  just  upon  the  propositions  considered  as  unanalyzed  wholes.  The 
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calculus  of  classes  must  next  be  introduced  to  take  care  of  this  type  of  logical 
structure.  It  would  unduly  lengthen  our  treatment  were  we  to  attempt  here 
the  detailed  introduction  and  unfolding  of  this  phase  of  logic,  though  we 
have  given  some  idea  of  the  calculus  of  classes  in  an  earlier  chapter.  We 
shall  accordingly  drop  our  discussion  of  the  development  of  symbolic  logic 
at  this  point,  and  finish  the  section  with  a  few  comments  bearing  on  some 
of  the  matters  already  covered. 

In  the  first  edition  of  Pnncipia  mathematica ,  Whitehead  and  Russell 

employed  five  postula  tes  for  the  development  of  the  calculus  of  propositions. 

In  addition  to  Postulates  LI,  L2,  L3,  L4  given  above,  they  also  employed 

the  postulate  r  .  .  , 

[p  V  (q  V  r)  ]  — »  [q  V  (p  V  r)  ]. 


Later,  in  1926,  P.  Bernays*  showed  that  this  additional  postulate  is  not 
independent  of  the  other  four,  but  that  it  can,  by  the  permitted  rules,  be 
derived  from  LI,  L2,  L3,  L4.  He  also  showed,  on  the  other  hand,  that  LI, 
L2,  L3,  L4  constitute  an  independent  set  of  postulates. 

As  with  other  systems  developed  via  the  postulational  method,  the 
calculus  of  propositions  can  be  evolved  from  more  than  one  set  of  primitive 
terms  and  from  more  than  one  set  of  postulates.  J.  B.  Rosser  and  W.  V. 
Quine,  for  example,  have  given  developments  based  upon  conjunction  and 
negation  as  primitive  operations.  Under  these  circumstances  the  postulate 
set  may  now  be  taken  as 


Def.  p  — ■»  q  means  (p  A  q')' . 

L'l :  p  (p  A  p). 

L'2:  (p  A  ?)—>/>. 

L'3:  (p  >  q)  *  [{q  A  r)'  — ►  (r  A  p)'  ]. 

Rules  Rl,  R2,  R3,  and  R4  are  still  used  for  deriving  the  theorems.  As  far 
back  as  1879,  Frege  chose  implication  and  negation  as  primitive  operations 
and  offered  a  set  of  six  postulates  in  terms  of  these  operations.  J.  Lukasiewicz 
has  shown  that  Frege’s  set  of  six  postulates  may  be  replaced  by  the  following 
simpler  system,  consisting  of  only  three  postulates: 

p-+{q-*p). 

[P  — >  (q  — ►  r) ]  — >  [{p  — >  q)  — ►  (p  — >  r)  ]. 

(P'  -*q')  (?—►/>)• 

Again,  Rules  Rl,  R2,  R3,  R4  are  used  for  deriving  theorems.  In  1916, 
J.  Nicodf  showed  that  the  calculus  of  propositions  follows  from  Sheffer’s 
stroke  as  a  primitive  operation,  together  with  the  single  postulate 

[Pl(<llr))l{{sl(sls)}l[{tlq)l[{plt)l{plt)\}}. 

*P.  Bernays  [1]  (Bibliography), 
tj.  Nicod  [1], 
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The  rule  of  implication  is  now  replaced  by  the  rule:  From  two  given  tautologies 
u  and  u/(v/w),  the  derived  tautology  w  may  be  obtained.  Other  postulational 
approaches  to  the  calculus  of  propositions,  besides  those  we  have  mentioned, 
have  been  offered. 

A  remarkable  feature  of  the  calculus  of  propositions  is  that  it  actually 
is  an  example  of  a  Boolean  algebra.  To  see  this,  let  us  interpret  the  elements 
a,  b,  c,  •  •  •  of  a  Boolean  algebra  to  mean  propositions,  let  VJ  and  F\  mean 
V  and  A ,  let  equality  mean  logical  equivalence,  and  let  u  and  z  mean  a 
tautologous  proposition  and  the  negative  of  a  tautologous  proposition, 
respectively.  Under  these  interpretations,  the  postulates  of  Boolean  algebra 
(see  Section  8.2)  become  the  following  theorems  in  the  calculus  of  proposi¬ 
tions: 
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The  reader  may  easily  verify,  by  using  the  method  of  truth  tables,  for  exam¬ 
ple,  that  these  are  indeed  theorems  in  the  calculus  of  propositions,  that  is, 
are  tautologies.  Since,  for  our  interpretations,  the  postulates  of  Boolean 
algebra  become  tautologies,  it  follows  that  the  calculus  of  propositions  can 
be  considered  an  example  of  a  Boolean  algebra. 

This  representation  of  the  calculus  of  propositions  as  a  Boolean  algebra 
is  of  great  importance.  It  means  that  any  theorem  of  Boolean  algebra  leads 
to  a  corresponding  theorem  in  the  calculus  of  propositions.  For  example, 
the  De  Morgan  laws  of  Boolean  algebra, 

(a  U  by  =  a'  n  bj  (a  H  b)'  =  a'  W  bj 

lead  to  the  following  theorems  in  the  calculus  of  propositions: 

( a  V  b)'  (a'  A  b'),  ( a  A  b)'  *-»•  (a'  V  b'). 

Again,  since  there  is  a  principle  of  duality  in  Boolean  algebra,  there  is  a 
corresponding  principle  of  duality  in  the  calculus  of  propositions;  it  is: 
Any  tautology  of  the  form  m  »  n,  where  m  and  n  are  expressed  only  by  means  of 
conjunctions ,  disjunctions ,  and  negations ,  remains  a  tautology  if  the  conjunctions  and 
disjunctions  are  interchanged  throughout  the  tautology. 

In  view  of  the  above  fact  that  the  calculus  of  propositions  is  an  inter¬ 
pretation  of  Boolean  algebra,  one  might  wonder  if  the  calculus  of  propositions 
can  be  derived  from  the  calculus  of  classes.  The  answer  to  this  is  “no,”  for 
the  calculus  of  classes  requires  the  logical  concepts  of  implication,  negation, 
and  so  on,  for  its  development.  Indeed,  the  calculus  of  propositions  cannot 
be  derived  from  any  other  system,  for  in  order  to  set  up  any  other  system 
we  need  the  properties  of  propositions.  That  is,  the  properties  of  propositions 


290  LOGIC  AND  PHILOSOPHY 


must  be  assumed  first  and  thus  cannot  be  derived  from  any  other  system. 
It  follows  that  the  calculus  of  propositions  occupies  a  unique  position  among 
deductive  systems  in  that  it  must  be  assumed  as  fundamental.  The  calculus 
of  propositions  constitutes  the  central  core  of  all  work  involving  deduction. 

9.3  Other  Logics 

An  interesting  analogy  (if  it  is  not  pushed  too  far)  exists  between  the 
parallelogram  law  of  forces  and  the  postulational  method.  By  the  parallelo¬ 
gram  law,  two  component  forces  are  combined  into  a  single  resultant  force. 
Different  resultant  forces  are  obtained  by  varying  one  or  both  of  the  com¬ 
ponent  forces,  although  it  is  possible  to  obtain  the  same  resultant  force  by 
taking  different  pairs  of  initial  component  forces.  Now,  just  as  the  resultant 
force  is  determined  by  the  two  initial  component  forces,  so  (see  Figure  40) 


is  a  mathematical  theory  determined  by  a  set  of  postulates  and  a  logic. 
That  is,  the  set  of  statements  constituting  a  mathematical  theory  results  from 
the  interplay  of  an  initial  set  of  statements,  called  the  postulates,  and  another 
initial  set  of  statements,  called  the  logic  or  the  rules  of  procedure.  For  some 
time  mathematicians  have  been  aware  of  the  variability  of  the  first  set  of 
initial  statements,  namely,  the  postulates,  but  until  very  recent  times  the 
second  set  of  initial  statements,  namely,  the  logic,  was  universally  thought 
to  be  fixed,  absolute,  and  immutable.  Indeed,  this  is  still  the  prevailing  view 
among  most  people,  for  it  seems  quite  inconceivable,  except  to  the  very  few 
students  of  the  subject,  that  there  could  be  any  alternative  to  the  laws  of 
logic  stated  by  Aristotle  in  the  fourth  century  b.c.  The  general  feeling  is  that 
these  laws  are  in  some  way  attributes  of  the  structure  of  the  universe  and 
that  they  are  inherent  in  the  very  nature  of  human  reasoning.  As  with  many 
other  absolutes  of  the  past,  this  one,  too,  has  toppled,  but  only  as  late  as 
1921.  The  modern  viewpoint  can  hardly  be  more  neatly  put  than  in  the 
following  words  of  the  outstanding  American  logician,  Alonzo  Church.  * 

We  do  not  attach  any  character  of  uniqueness  or  absolute  truth  to  any 
particular  system  of  logic.  The  entities  of  formal  logic  are  abstractions,  invented 

*A.  Church  [3],  pp.  348,  349. 
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because  of  their  use  in  describing  and  systematizing  facts  of  experience  or  obser¬ 
vation,  and  their  properties,  determined  in  rough  outline  by  this  intended  use, 
depend  for  their  exact  character  on  the  arbitrary  choice  of  the  inventor.  We  may 
draw  the  analogy  of  a  three  dimensional  geometry  used  in  describing  physical 
space,  a  case  for  which,  we  believe,  the  presence  of  such  a  situation  is  more 
commonly  recognized.  >The  entities  of  the  geometry  are  clearly  of  abstract 
character,  numbering  as  they  do  planes  without  thickness  and  points  which 
cover  no  area  in  the  plane,  point  sets  containing  an  infinitude  of  points,  lines  of 
infinite  length,  and  other  things  which  cannot  be  reproduced  in  any  physical 
experiment.  Nevertheless  the  geometry  can  be  applied  to  physical  space  in  such 
a  way  that  an  extremely  useful  correspondence  is  set  up  between  the  theorems 
of  the  geometry  and  observable  facts  about  material  bodies  in  space.  In  building 
the  geometry,  the  proposed  application  to  physical  space  serves  as  a  rough  guide 
in  determining  what  properties  the  abstract  entities  shall  have,  but  does  not 
assign  these  properties  completely.  Consequently  there  may  be,  and  actually 
are,  more  than  one  geometry  whose  use  is  feasible  in  describing  physical  space. 
Similarly,  there  exist,  undoubtedly,  more  than  one  formal  system  whose  use  as 
a  logic  is  feasible,  and  of  these  systems  one  may  be  more  pleasing  or  more 
convenient  than  another,  but  it  cannot  be  said  that  one  is  right  and  the  other 
wrong. 

It  will  be  recalled  that  new  geometries  first  came  about  through  the 
denial  of  Euclid’s  parallel  postulate,  and  that  new  algebras  first  came  about 
through  the  denial  of  the  commutative  law  of  multiplication.  In  similar 
fashion,  the  new  so-called  “many-valued  logics”  first  came  about  by  denying 
Aristotle’s  law  of  excluded  middle.  According  to  this  law,  the  disjunctive 
proposition  p  V  p'  is  a  tautology,  and  a  proposition  p  in  Aristotelian  logic 
is  always  either  true  or  false.  Because  a  proposition  may  possess  any  one 
of  two  possible  truth  values,  namely  truth  or  falsity,  this  logic  is  known  as 
a  two-valued  logic.  In  1921,  in  a  short  two-page  paper,  J.  Lukasiewicz 
considered  a  three-valued  logic,  or  a  logic  in  which  a  proposition  p  may 
possess  any  one  of  three  possible  truth  values.  Very  shortly  after,  and 
independently  of  Lukasiewicz’s  work,  E.  L.  Post  considered  m-valued 
logics,  in  which  a  proposition  p  may  possess  any  one  of  m  possible  truth 
values,  where  m  is  any  integer  greater  than  1 .  If  m  exceeds  2,  the  logic  is 
said  to  be  many-valued.  Another  study  of  w?-valued  logics  was  given  in  1930 
by  Lukasiewicz  and  A.  Tarski.  Then,  in  1932,  the  m-valued  truth  systems 
were  extended  by  H.  Reichenbach  to  an  infinite-valued  logic,  in  which  a 
proposition  p  may  assume  any  one  of  infinitely  many  possible  values. 

*As  a  matter  of  historical  interest,  in  1936  K.  Michalski  discovered  that  three-valued 
logics  had  actually  been  anticipated  as  early  as  the  fourteenth  century  by  the  medieval 
schoolman,  William  of  Occam.  The  possibility  of  a  three-valued  logic  had  also  been 
considered  by  the  philosopher  Hegel  and,  in  1896,  by  Hugh  MacColl.  These  specu¬ 
lations,  however,  had  little  effect  on  subsequent  thought  and  so  cannot  be  considered 
as  decisive  contributions. 
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Not  all  new  logics  are  of  the  type  just  discussed.  Thus  A.  Heyting  has 
developed  a  symbolic  two-valued  logic  to  serve  the  intuitionist  school  of 
mathematicians;  it  differs  from  Aristotelian  logic  in  that  it  does  not  univers¬ 
ally  accept  the  law  of  excluded  middle  or  the  law  of  double  negation.  Like 
the  many-valued  logics,  then,  this  special-purpose  logic  exhibits  differences 
from  Aristotelian  laws.  Such  logics  are  known  as  non-Aristotelian  logics. 
As  a  matter  of  fact,  the  symbolic  two-valued  logic  of  Principia  mathematical 
which  was  described  in  the  last  section,  is  really  non-Aristotelian;  it  differs 
from  Aristotelian  logic  in  its  meaning  of  implication. 

Like  the  non-Euclidean  geometries,  the  non-Aristotelian  logics  have 
proved  not  to  be  barren  of  application.  Reichenbach  actually  devised  his 
infinite-valued  logic  to  serve  as  a  basis  for  the  mathematical  theory  of 
probability.  And  in  1933  F.  Zwicky  observed  that  many-valued  logics  can 
be  applied  to  the  quantum  theory  of  modern  physics.  Many  of  the  details 
of  such  an  application  have  been  supplied  by  Garrett  Birkhoff,  J.  von 
Neumann,  and  H.  Reichenbach.  Lukasiewicz  has  employed  three-valued 
logics  to  establish  the  independence  of  the  postulates  of  familiar  two-valued 
logic.*  The  part  that  non-Aristotelian  logics  may  play  in  the  future 
development  of  mathematics  is  uncertain  but  intriguing  to  contemplate; 
the  application  of  Heyting’s  symbolic  logic  to  intuitionistic  mathematics 
indicates  that  the  new  logics  may  be  mathematically  valuable.  In  the  next 
section  we  shall  point  out  a  possible  use  of  these  logics  in  the  resolution  of 
a  modern  crisis  in  the  foundations  of  mathematics. 

Anything  like  a  full  treatment  of  many-valued  logics  would  not  be 
feasible  here,  but  perhaps  it  is  possible  to  give  some  idea  of  what  such  a  logic 
is  like.  For  simplicity  we  shall  consider  a  three-valued  logic  and  shall  attempt 
to  build  up  the  logic  by  generalizing  some  of  the  concepts  of  the  familiar 
two-valued  logic  considered  in  the  last  section. 

We  shall  employ  the  method  of  truth  tables,  starting  with  a  truth  table 
for  conjunction.  We  first  of  all  repeat  the  truth  table  for  conjunction  for  the 
two-valued  logic,  displaying  the  table  in  the  more  convenient  form  of  a 
multiplication  table  as  shown  in  Figure  41 .  This  table  is  to  be  read  as  follows. 
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*See  J.  Lukasiewicz. 
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Down  the  left-hand  column  appear  the  possible  truth  values  for  proposition 

p,  and  across  the  top  row  appear  the  possible  truth  values  for  the  proposition 

q.  Now,  knowing  the  truth  value  of  p  and  of  q ,  one  can  find  the  truth  value 
of  p  A  q  by  looking  in  the  table  to  the  right  of  the  value  of  p  and  below  the 
value  of  q.  Since,  by  definition,  p  A  q  is  to  be  true  when  and  only  when  both 
p  and  q  are  true,  a  7  appears  in  the  top  left  box  of  the  table  and  F’s  appear 
in  all  the  other  boxes.  It  is  to  be  noted  that  the  table  is  uniquely  determined 
by  the  definition  of  p  A  q. 

We  now  proceed  to  the  three-valued  logic  and  again  agree  to  take  the 
conjunction  p  A  q  to  be  true  when  and  only  when  both  p  and  q  are  true. 
Denoting  the  three  possible  truth  values  of  a  proposition  by  7*,  ?,  and  F, 
we  begin  to  construct  a  truth  table  as  in  Figure  42.  By  our  agreement 
concerning  the  meaning  of  p  A  <7,  the  top  left  box  in  the  table  must  contain 
a  T,  and  no  other  box  in  the  table  is  allowed  to  contain  a  T.  Since  there  are 
eight  remaining  boxes  and  each  may  be  filled  in  either  of  two  possible  ways, 
namely,  with  either  an  F  or  a  ?,  there  are  altogether  28  =  256  possible  ways 


of  filling  the  eight  boxes.  It  follows  that  there  are  256  different  ways  of 
developing  a  truth  table  for  conjunction  in  a  three-valued  logic! 

Figures  43a  and  43b  illustrate  two  of  the  possible  256  truth  tables  for 
conjunction  in  a  three-valued  logic.  The  truth  table  of  Figure  43a  is  the  one 

<7 


P 


A 

T 

p 

F 

T 

T 

p 

F 

p 

p 

? 

F 

F 

F 

F 

F 

H 


A 

T 

p 

F 

T 

r 

p 

F 

p 

p 

p 

? 

F 

F 

p 

F 

H 


A 

T 

p 

F 

T 

T 

p 

F 

FIG.  42 


FIG.  43a 


FIG.  43b 
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favored  by  Lukasiewicz,  Post,  and  J.  B.  Rosser,  and  is  formed  by  agreeing 
that  ?  is  falser  than  T,  F  is  falser  than  ?,  and  p  A  q  is  exactly  as  true  as  the 
falser  of  its  two  factors.  The  truth  table  in  Figure  43b  is  favored  by  D.  A. 
Bocvar,  and  is  obtained  by  considering  ?  to  mean  undecidable  and  by  agreeing 
that  p  A  q  is  undecidable  when  and  only  when  either  p  or  q  is  undecidable. 

Let  us  now  consider  a  truth  table  for  negation.  Our  only  restriction  will 
be  that  when  p  is  true,  p'  must  fail  to  be  true,  and  when  p  is  false,  pr  must 
fail  to  be  false.  Whereas  this  restriction  completely  determines  the  truth 
table  for  negation  in  a  two-valued  logic,  it  permits  twelve  possible  tables 
in  a  three-valued  logic.  Figures  44a  and  44b  illustrate  two  of  these  possible 
tables.  The  truth  table  in  Figure  44a  is  favored  by  Post  and  is  based  on  the 
assumption  that,  since  in  a  two-valued  logic  (//)'  is  the  same  as  p,  in  a  three¬ 
valued  logic  we  ought  to  have  [(//)'  ]'  the  same  as  p.  The  truth  table  in  Figure 
44b  is  favored  by  Bocvar,  Lukasiewicz,  and  Rosser,  and  is  based  on  the 
assumption  that  (//)'  is  the  same  as  p. 
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Truth  tables  for  the  other  logical  connectives  can  now  be  formed  by 
defining  these  other  connectives  in  terms  of  conjunction  and  negation.  Since 
conjunction  and  negation  are  independent  operations,  whereas,  as  we  have 
observed,  the  other  operations  may  be  made  to  depend  upon  these  two,  it 
follows  that  there  are  altogether  (256)  (12)  =  3072  possible  three-valued 
logics.  A  many-valued  logic  apparently  possesses  an  enormous  wealth  of 
possible  structure. 

In  some  treatments  of  many-valued  logics,  each  proposition  p  has  associ¬ 
ated  with  it  a  real  number,  r(/>),  lying  in  the  range  from  0  to  1,  inclusive, 
and  called  its  truth  value.  The  truth  value  of  a  proposition  under  such 
circumstances  may  be  interpreted  to  mean  the  probability  that  the  proposi¬ 
tion  is  true,  and  two  propositions  having  the  same  truth  values  can  be 
considered  to  be  logically  equivalent.  The  truth  value  1  means  truth;  the 
truth  value  0  means  falsity.  Negation  may  be  defined  in  terms  of  truth  values 
by  T(p')  =  1  —  r(p).  Thus,  in  a  three-valued  logic,  the  truth  values  may 
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be  taken  as  0,  1  / 2,  and  1 .  If  p  has  the  truth  value  1  / 2,  then  p'  also  has  the  truth 
value  1  / 2,  and  p  is  logically  equivalent  to  its  own  negation.  Such  a  proposition 
may  be  called  doubtful ,  and  its  negation  is  then  also  doubtful.  This  approach 
shows  the  intimate  connection  between  many-valued  logics  and  the  theory 
of  probability. 

Perhaps  the  above  fragmentary  treatment  is  sufficient  to  give  the  reader 
some  idea  of  the  nature  of  non-Aristotelian  logics.  From  it  all  there  emerges 
a  remarkable  principle  of  discovery  and  advancement,  namely,  the  construc¬ 
tive  doubting  of  a  traditional  belief.  When  Einstein  was  asked  how  he  came 
to  invent  the  theory  of  relativity  he  replied,  “By  challenging  an  axiom.” 
Lobachevsky  and  Bolyai  challenged  Euclid’s  axiom  of  parallels;  Hamilton 
and  Cayley  challenged  the  axiom  that  multiplication  is  commutative; 
Lukasiewicz  and  Post  challenged  Aristotle’s  axiom  of  the  excluded  middle. 
Similarly,  in  the  field  of  science,  Copernicus  challenged  the  axiom  that  the 
earth  is  the  center  of  the  solar  system;  Galileo  challenged  the  axiom  that 
the  heavier  body  falls  the  faster;  Einstein  challenged  the  axiom  that  of  two 
distinct  instants  one  must  precede  the  other.  This  constructive  challenging 
of  axioms  has  become  one  of  the  commoner  ways  of  making  advances  in 
mathematics,  and  it  undoubtedly  lies  at  the  heart  of  Georg  Cantor’s  famous 
aphorism:  “The  essence  of  mathematics  lies  in  its  freedom.” 

9.4  Crises  in  the  Foundations 

of  Mathematics 

A  study  of  the  history  of  mathematics  from  Greek  antiquity  to  the 
present  reveals  that  the  foundations  of  mathematics  have  undergone  three 
profoundly  disturbing  crises. 

The  first  crisis  in  the  foundations  of  mathematics  arose  in  the  fifth 
century  b.c.,  and  indeed,  such  a  crisis  could  not  have  occurred  much  earlier, 
for,  as  we  have  seen,  mathematics  as  a  deductive  study  originated  not  earlier 
than  the  sixth  century  b.c.,  perhaps  with  Thales,  Pythagoras,  and  their 
pupils.  The  crisis  was  precipitated  by  the  unexpected  discovery  that  not  all 
geometrical  magnitudes  of  the  same  kind  are  commensurable  with  one 
another;  it  was  shown,  for  example,  that  the  diagonal  and  side  of  a  square 
contain  no  common  unit  of  measure.  Since  the  Pythagorean  development  of 
magnitudes  was  built  upon  the  firm  intuitive  belief  that  all  like  magnitudes 
are  commensurable,  the  discovery  that  like  magnitudes  may  be  incom¬ 
mensurable  proved  to  be  highly  devastating.  For  instance,  the  entire  Pythag¬ 
orean  theory  of  proportion  with  all  of  its  consequences  had  to  be  scrapped 
as  unsound.  The  resolution  of  this  first  crisis  in  the  foundations  of  mathe¬ 
matics  was  neither  easily  nor  quickly  realized.  It  was  finally  achieved  in 
about  370  b.c.  by  the  F>rilliant  Eudoxus,  whose  revised  theory  of  magnitude 
and  proportion  is  one  of  the  great  mathematical  masterpieces  of  all  time. 
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Eudoxus’  remarkable  treatment  of  incommensurables  is  found  in  the  fifth 
book  of  Euclid’s  Elements ;  *  it  coincides  essentially  with  the  modern  exposition 
of  irrational  numbers  that  was  given  by  Richard  Dedekind  in  1872.  We 
considered  this  first  crisis  in  the  foundations  of  mathematics  in  Section  1.5, 
where,  among  other  things,  we  commented  upon  its  possible  relation  to  the 
formulation  of  the  axiomatic  method. 

The  second  crisis  in  the  foundations  of  mathematics  followed  the 
discovery  of  the  calculus  by  Newton  and  Leibniz  in  the  late  seventeenth 
century,  f  We  have  seen  how  the  successors  of  these  men,  intoxicated  by  the 
power  and  applicability  of  the  new  tool,  failed  to  consider  sufficiently  the 
solidity  of  the  base  upon  which  the  subject  was  founded,  so  that  instead  of 
having  demonstrations  justify  results,  results  were  used  to  justify  demon¬ 
strations.  With  the  passage  of  time,  contradictions  and  paradoxes  arose  in 
increasing  numbers,  and  a  serious  crisis  in  the  foundations  of  mathematics 
became  evident.  It  was  realized  more  and  more  that  the  edifice  of  analysis 
was  being  built  upon  sand,  and  finally,  in  the  early  nineteenth  century, 
Cauchy  took  the  first  steps  toward  resolving  the  crisis  by  replacing  the  hazy 
method  of  infinitesimals  by  the  precise  method  of  limits.  With  the  subsequent 
so-called  arithmetization  of  analysis  by  Weierstrass  and  his  followers,  it  was 
felt  that  the  second  crisis  in  the  foundations  of  mathematics  had  been 
overcome,  and  that  the  whole  structure  of  mathematics  had  been  redeemed 
and  placed  upon  an  unimpeachable  base.  The  origin  and  resolution  of  this 
second  crisis  in  the  foundations  of  mathematics  constituted  the  subject  matter 
of  Chapter  7. 

The  third  crisis  in  the  foundations  of  mathematics  materialized  with 
shocking  suddenness  in  1897,  and,  though  now  well  over  half  a  century  old, 
is  still  not  resolved  to  the  satisfaction  of  all  concerned.  The  crisis  was  brought 
about  by  the  discovery  of  paradoxes  or  antinomies  in  the  fringe  of  Cantor’s 
general  theory  of  sets.  Since  so  much  of  mathematics  is  permeated  with  set 
concepts  and,  for  that  matter,  can  actually  be  made  to  rest  upon  set  theory 
as  a  foundation,  the  discovery  of  paradoxes  in  set  theory  naturally  cast  into 
doubt  the  validity  of  the  whole  foundational  structure  of  mathematics. 

In  1897  the  Italian  mathematician,  Burali-Forti,  brought  to  light  the 
first  publicized  paradox  of  set  theory.  J  As  originally  conceived  and  stated 
by  Burali-Forti,  the  paradox  involves  technical  terms  and  ideas  which,  in 
our  limited  treatment  of  sets  in  Chapter  8,  we  lacked  space  to  develop.  The 
essence  of  the  paradox  can  be  given,  however,  by  a  nontechnical  description 
of  a  very  similar  paradox  found  by  Cantor  two  years  later.  In  his  theory 
of  sets,  Cantor  had  succeeded  in  proving  that  for  any  given  transhnite 

*See  Appendix,  Section  A. 6. 

fForewarnings  of  this  crisis  can  be  seen  in  the  renowned  paradoxes  of  Zeno  of  about 

450  b.c. 

fC.  Burali-Forti. 
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number  there  is  always  a  greater  transfinite  number,  so  that  just  as  there  is 
no  greatest  natural  number,  there  also  is  no  greatest  transfinite  number. 
Now  consider  the  set  whose  members  are  all  possible  sets.  Surely  no  set 
can  have  more  members  than  this  set  of  all  sets.  But  if  this  is  the  case,  how 
can  there  be  a  transfinite  number  greater  than  the  transfinite  number  of 
this  set? 

Whereas  the  Burali-Forti  and  Cantor  paradoxes  involve  results  of  set 
theory,  Bertrand  Russell  discovered  in  1902  a  paradox  depending  on  nothing 
more  than  just  the  concept  of  set  itself.  Before  describing  the  Russell  paradox, 
we  note  that  sets  either  are  members  of  themselves  or  are  not  members  of 
themselves.  Thus  the  set  of  all  abstract  ideas  is  itself  an  abstract  idea,  but 
the  set  of  all  men  is  not  a  man.  Again,  the  set  of  all  sets  is  itself  a  set,  but 
the  set  of  all  stars  is  not  a  star.  Let  us  represent  the  set  of  all  sets  which 
are  members  of  themselves  by  M ,  and  the  set  of  all  sets  which  are  not 
members  of  themselves  by  TV.  We  now  ask  ourselves  whether  set  TV  is  or 
is  not  a  member  of  itself.  If  TV  is  a  member  of  itself,  then  TV  is  a  member 
of  M  and  not  of  TV,  and  TV  is  not  a  member  of  itself.  On  the  other  hand,  if 
N  is  not  a  member  of  itself,  then  jV  is  a  member  of  N  and  not  of  M,  and  TV 
is  a  member  of  itself.  The  paradox  lies  in  the  fact  that  in  either  case  we  are 
led  to  a  contradiction. 

A  more  compact  and  less  wordy  presentation  of  the  Russell  paradox 
may  be  given  as  follows.  Let  X  denote  any  set.  Then,  by  the  definition  of  TV, 

(X  <E  TV)  (A  £  X). 

Now  take  X  to  be  TV,  and  we  have  the  contradiction 

(TV  €  TV)  <-►  (TV  £  X). 

This  paradox  was  communicated  by  Russell  to  Frege  just  after  the 
latter  had  completed  the  last  volume  of  his  great  two-volume  treatise  on  the 
foundations  of  arithmetic.  Frege  acknowledged  the  communication  at  the 
end  of  his  treatise  by  the  following  pathetic  and  remarkably  restrained 
sentences.  “A  scientist  can  hardly  meet  with  anything  more  undesirable  than 
to  have  the  foundation  give  way  just  as  the  work  is  finished.  In  this  position 
I  was  put  by  a  letter  from  Mr.  Bertrand  Russell  as  the  work  was  nearly 
through  the  press.”  Thus  terminated  the  labor  of  a  dozen  or  more  years. 

The  Russell  paradox  has  been  popularized  in  many  forms.  One  of  the 
best  known  of  these  forms  was  given  by  Russell  himself  in  1919  and  concerns 
the  plight  of  the  barber  of  a  certain  village  who  has  enunciated  the  principle 
that  he  shaves  all  those  persons  and  only  those  persons  of  the  village  who 
do  not  shave  themselves.  The  paradoxical  nature  of  this  situation  is  realized 
when  we  try  to  answer  the  question,  “Does  the  barber  shave  himself?” 
If  he  does  shave  himself,  then  he  shouldn’t  according  to  his  principle; 
if  he  doesn’t  shave  himself,  then  he  should  according  to  his  principle. 
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Since  the  discovery  of  the  above  contradictions  within  Cantor’s  theory 
of  sets,  additional  paradoxes  have  been  produced  in  abundance.  These 
modern  paradoxes  of  set  theory  are  related  to  several  ancient  paradoxes  of 
logic.  For  example,  Eubulides,  of  the  fourth  century  b.c.,  is  credited  with 
making  the  remark,  “This  statement  I  am  now  making  is  false.”  If  Eubulides’ 
statement  is  true,  then,  by  what  it  says,  the  statement  must  be  false.  On 
the  other  hand,  if  Eubulides’  statement  is  false,  then  it  follows  that  his 
statement  must  be  true.  Thus  Eubulides’  statement  can  neither  be  true  nor 
false  without  entailing  a  contradiction.  Still  older  than  the  Eubulides  paradox 
may  be  the  unauthenticated  Epimenides  paradox.  Epimenides,  who  himself 
was  a  Cretan  philosopher  of  the  sixth  century  b.c.,  is  claimed  to  have  made 
the  remark,  “Cretans  are  always  liars.”  A  simple  analysis  of  this  remark 
easily  reveals  that  it,  too,  is  self-contradictory. 

The  existence  of  paradoxes  in  set  theory,  like  those  described  above, 
clearly  indicates  that  something  is  wrong.  Since  their  discovery,  a  great 
deal  of  literature  on  the  subject  has  appeared,  and  numerous  attempts  at 
a  solution  have  been  offered. 

So  far  as  mathematics  is  concerned,  there  seems  to  be  an  easy  way  out. 
One  has  merely  to  reconstruct  set  theory  on  an  axiomatic  basis  sufficiently 
restrictive  to  exclude  the  known  antinomies.  The  first  such  attempt  was  made 
by  Zermelo  in  1908,  and  subsequent  refinements  have  been  made  by  Fraenkel 
(1922,  1925),  Skolem  (1922,  1929),  von  Neumann  (1925,  1928),  Bernays 
(1937-1948),  and  others.  But  such  a  procedure  has  been  criticized  as  merely 
avoiding  the  paradoxes;  certainly  it  does  not  explain  them.  Moreover,  this 
procedure  carries  no  guarantee  that  other  kinds  of  paradoxes  will  not  crop 
up  in  the  future. 

There  is  another  procedure  which  apparently  both  explains  and  avoids 
the  known  paradoxes.  If  examined  carefully,  it  will  be  seen  that  each  of  the 
paradoxes  considered  above  involves  a  set  S  and  a  member  m  of  S  whose 
definition  depends  upon  S.  Such  a  definition  is  said  to  be  impredicative ,  and 
impredicative  definitions  are,  in  a  sense,  circular.  Consider,  for  instance, 
Russell’s  barber  paradox.  Let  us  designate  the  barber  by  m  and  the  set  of  all 
members  of  the  barber’s  village  by  S.  Then  m  is  defined  impredicatively 
as  “that  member  of  S  who  shaves  all  those  members  and  only  those  members 
of  S  who  do  not  shave  themselves.”  The  circular  nature  of  this  definition  is 
evident— the  definition  of  the  barber  involves  the  members  of  the  village 
and  the  barber  himself  is  a  member  of  the  village. 

Poincare  considered  the  cause  of  the  antinomies  to  lie  in  impredicative 
definitions,  and  Russell  expressed  the  same  view  in  his  vicious  circle 
principle:  No  set  S  is  allowed  to  contain  members  m  definable  only  in  terms  of  S, 
or  members  m  involving  or  presupposing  S.  This  principle  amounts  to  a  restriction 
on  the  concept  of  set.  Cantor  had  attempted  to  give  the  concept  of  set  a 
very  general  meaning  by  stating:  By  a  set  S  we  are  to  understand  any  collection 
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into  a  whole  of  definite  and  separate  objects  m  of  our  intuition  or  our  thought ;  these 
objects  m  are  called  the  elements  of  S.  The  theory  of  sets  constructed  on  Cantor’s 
general  concept  of  set  leads,  as  we  have  seen,  to  contradictions,  but  if  the 
notion  of  set  is  restricted  by  the  vicious  circle  principle,  the  resulting  theory 
avoids  the  known  antinomies.  The  outlawing  of  impredicative  definitions 
would  appear,  then,  to  be  a  solution  to  the  known  paradoxes  of  set  theory. 
There  is,  however,  one  serious  objection  to  this  solution;  namely,  there  are 
parts  of  mathematics,  which  mathematicians  are  very  reluctant  to  discard, 
that  contain  impredicative  definitions. 

An  example  of  an  impredicative  definition  in  mathematics  is  that  of 
the  least  upper  bound  of  a  given  set  of  real  numbers— the  least  upper  bound 
of  the  given  set  is  the  smallest  member  of  the  set  of  all  upper  bounds  of 
the  given  set.  There  are  many  similar  instances  of  impredicative  definitions 
in  mathematics,  though  many  of  them  can  be  circumvented.  In  1918, 
Hermann  Weyl  undertook*  to  find  out  how  much  of  analysis  can  be  con¬ 
structed  genetically  from  the  natural  number  system  without  the  use  of 
impredicative  definitions.  Although  he  succeeded  in  obtaining  a  consider¬ 
able  part  of  analysis,  he  was  unable  to  derive  the  important  theorem  that 
every  nonempty  set  of  real  numbers  having  an  upper  bound  has  a  least 
upper  bound. 

Other  attempts  to  solve  the  paradoxes  of  set  theory  look  for  the  trouble 
in  logic,  and  it  must  be  admitted  that  the  discovery  of  the  paradoxes  in 
the  unrestricted  theory  of  sets  has  brought  about  a  thorough  investigation 
of  the  foundations  of  logic.  Very  intriguing  is  the  suggestion  that  the  way 
out  of  the  difficulties  of  the  paradoxes  may  be  through  the  use  of  a  three¬ 
valued  logic.  For  example,  in  the  Russell  paradox  given  above,  we  saw  that 
the  statement,  ujV  is  a  member  of  itself,”  can  be  neither  true  nor  false. 
Here  a  third  possibility  would  be  helpful,  and  the  situation  would  be  saved 
if  we  could  simply  classify  the  statement  as  ?. 

There  have  arisen  three  main  philosophies,  or  schools  of  thought, 
concerning  the  foundations  of  mathematics— the  so-called  logistic,  intui- 
tionist,  and  formalist  schools.  Naturally,  any  modern  philosophy  of  the 
foundations  of  mathematics  must,  somehow  or  other,  cope  with  the  present 
crisis  in  the  foundations  of  mathematics.  In  the  next  section  we  shall  very 
briefly  consider  these  three  schools  of  thought  and  shall  point  out  how  each 
proposes  to  deal  with  the  antinomies  of  general  set  theory. 

9.5  Philosophies  of  Mathematics 

A  philosophy  may  be  regarded  as  an  explanation  which  attempts  to 
make  some  kind  of  sense  out  of  the  natural  disorder  of  a  set  of  experiences. 
From  this  point  of  view  it  is  possible  to  have  a  philosophy  of  almost  anything 


*H.  Weyl  [1  ]. 
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—  a  philosophy  of  art,  of  life,  of  religion,  of  education,  of  society,  of  history, 
of  science,  of  mathematics,  even  of  philosophy  itself.  A  philosophy  amounts 
to  a  process  of  refining  and  ordering  experiences  and  values;  it  seeks  relations 
among  things  which  are  normally  felt  to  be  disparate,  and  finds  important 
differences  between  things  normally  confused  as  the  same;  it  is  the  descrip¬ 
tion  of  a  theory  concerning  the  nature  of  something.  In  particular,  a  philos¬ 
ophy  of  mathematics  essentially  amounts  to  an  attempted  reconstruction 
in  which  the  chaotic  mass  of  mathematical  knowledge  accumulated  over 
the  ages  is  given  a  certain  sense  or  order.  Clearly,  a  philosophy  is  a  function 
of  time,  and  a  particular  philosophy  may  become  outdated  or  have  to  be 
altered  in  the  light  of  additional  experiences.  We  are  here  concerned  with 
only  contemporary  philosophies  of  mathematics— philosophies  which  take 
account  of  the  recent  advances  in  mathematics  and  of  the  current  crisis  in 
the  subject. 

There  are  three  principal  present-day  philosophies  of  mathematics, 
each  of  which  has  attracted  a  sizable  group  of  adherents  and  developed  a 
large  body  of  associated  literature.  These  are  referred  to  as  the  logistic 
school,  of  which  Russell  and  Whitehead  are  the  chief  expositors;  the 
intuitionist  school,  led  by  Brouwer;  and  the  formalist  school,  developed 
principally  by  Hilbert.  There  are,  of  course,  present-day  philosophies  of 
mathematics  other  than  these  three.  There  are  some  independent  philosophies 
and  some  which  constitute  various  mixtures  of  the  principal  three,  but  these 
other  points  of  view  have  not  been  so  widely  cultivated,  or  do  not  comprise 
a  reconstruction  of  mathematics  of  similar  extent. 

In  this  section  we  shall  attempt  a  description  of  each  of  these  three 
principal  philosophies  of  mathematics.  Obviously,  it  will  not  be  possible 
in  a  work  of  the  size  and  nature  of  the  present  book  to  do  proper  justice 
to  this  subject,  but  it  is  hoped  that  the  treatment  will  be  sufficient  to  give 
some  idea  of  these  current  schools  of  thought  which  are  so  intimately 
connected  with  the  foundations  and  fundamental  concepts  of  mathematics. 

Logicism 

The  logistic  thesis  is  that  mathematics  is  a  branch  of  logic.  Rather  than 
being  just  a  tool  of  mathematics,  logic  becomes  the  progenitor  of  mathe¬ 
matics.  All  mathematical  concepts  are  to  be  formulated  in  terms  of  logical 
concepts,  and  all  theorems  of  mathematics  are  to  be  developed  as  theorems 
of  logic;  the  distinction  between  mathematics  and  logic  becomes  merely  one 
of  practical  convenience. 

The  notion  of  logic  as  a  science  containing  the  principles  and  ideas 
underlying  all  other  sciences  dates  back  at  least  as  far  as  Leibniz  (1666). 
The  actual  reduction  of  mathematical  concepts  to  logical  concepts  was 
engaged  in  by  Dedekind  (1888)  and  Frege  (1884-1903),  and  the  statement  of 
mathematical  theorems  by  means  of  a  logical  symbolism  was  undertaken 
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by  Peano  (1889-1908).  These  men,  then,  are  forerunners  of  the  logistic 
school,  which  received  its  definitive  expression  in  the  monumental  Principia 
mathematica  of  Whitehead  and  Russell  (1910-1913).  This  great  and  complex 
work  purports  to  be  a  detailed  reduction  of  the  whole  of  mathematics  to 
logic.  Subsequent  modifications  and  refinements  of  the  program  have  been 
supplied  by  Wittgenstein  (1922),  Chwistek  (1924-1925),  Ramsey  (1926), 
Langford  (1927),  Carnap  (1931),  Quine  (1940),  and  others. 

The  logistic  thesis  arises  naturally  from  the  effort  to  push  down  the 
foundations  of  mathematics  to  as  deep  a  level  as  possible.  We  have  seen 
how  these  foundations  were  established  in  the  real  number  system,  and  then 
how  they  were  pushed  back  from  the  real  number  system  to  the  natural 
number  system,  and  thence  into  set  theory.  Since  the  theory  of  classes  is 
an  essential  part  of  logic,  the  idea  of  reducing  mathematics  to  logic  certainly 
suggests  itself.  The  logistic  thesis  is  thus  an  attempted  synthesization 
suggested  by  an  important  trend  in  the  history  of  the  application  of  the 
postulational  method. 

The  Principia  mathematica  starts  with  “primitive  ideas”  and  “primitive 
propositions,”  corresponding  to  the  “undefined  terms”  and  “postulates” 
of  a  formal  abstract  development.  These  primitive  ideas  and  propositions 
are  not  to  be  subjected  to  interpretation  but  are  restricted  to  intuitive  con¬ 
cepts  of  logic;  they  are  to  be  regarded  as,  or  at  least  are  to  be  accepted  as, 
plausible  descriptions  and  hypotheses  concerning  the  real  world.  In  short, 
a  concrete  rather  than  an  abstract  point  of  view  prevails,  and  consequently 
no  attempt  is  made  to  prove  the  consistency  of  the  primitive  propositions. 
The  aim  of  Principia  mathematica  is  to  develop  mathematical  concepts  and 
theorems  from  these  primitive  ideas  and  propositions,  starting  with  a  calculus 
of  propositions,  proceeding  up  through  the  theory  of  classes  and  relations 
to  the  establishment  of  the  natural  number  system,  and  thence  to  all  mathe¬ 
matics  derivable  from  the  natural  number  system.  In  this  development 
the  natural  numbers  emerge  with  the  unique  meanings  which  we  ordinarily 
assign  to  them  and  are  not  nonuniquely  defined  as  any  things  which  satisfy 
a  certain  set  of  abstract  postulates. 

To  avoid  the  contradictions  of  set  theory,  Principia  mathematica  employs 
a  “theory  of  types.”  Somewhat  oversimply  described,  such  a  theory  sets  up 
a  hierarchy  of  levels  of  elements.  The  primary  elements  constitute  those  of 
type  0;  classes  of  elements  of  type  0  constitute  those  of  type  1;  classes  of 
elements  of  type  1  constitute  those  of  type  2;  and  so  on.  In  applying  the 
theory  of  types,  one  follows  the  rule  that  all  the  elements  of  any  class  must 
be  of  the  same  type.  Adherence  to  this  rule  precludes  impredicative  defini¬ 
tions  and  thus  avoids  the  paradoxes  of  set  theory.  As  originally  presented  in 
Principia  mathematica ,  hierarchies  within  hierarchies  appeared,  leading  to  the 
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so-called  “ramified”  theory  of  types.  In  order  to  obtain  the  impredicative 
definitions  needed  to  establish  analysis,  an  “axiom  of  reducibility”  had  to 
be  introduced.  The  nonprimitive  and  arbitrary  character  of  this  axiom  drew 
forth  severe  criticism,  and  much  of  the  subsequent  refinement  of  the  logistic 
program  lies  in  attempts  to  devise  some  method  of  avoiding  the  disliked 
axiom  of  reducibility. 

Whether  or  not  the  logistic  thesis  has  been  established  seems  to  be  a 
matter  of  opinion.  Although  some  accept  the  program  as  satisfactory,  others 
have  found  many  objections  to  it.  For  one  thing,  the  logistic  thesis  can  be 
questioned  on  the  ground  that  the  systematic  development  of  logic  (as  of 
any  organized  study)  presupposes  mathematical  ideas  in  its  formulation, 
such  as  the  fundamental  idea  of  iteration  which  must  be  used,  for  example, 
in  describing  the  theory  of  types  or  the  idea  of  deduction  from  given 
premises. 


Intuitionism 

The  intuitionist  thesis  is  that  mathematics  is  to  be  built  solely  by  finite 
constructive  methods  on  the  intuitively  given  sequence  of  natural  numbers. 
According  to  this  view,  then,  at  the  very  base  of  mathematics  lies  a  primitive 
intuition,  allied,  no  doubt,  to  our  temporal  sense  of  before  and  after,  which 
allows  us  to  conceive  a  single  object,  then  one  more,  then  one  more,  and 
so  on  endlessly.  In  this  way  we  obtain  unending  sequences,  the  best  known 
of  which  is  the  sequence  of  natural  numbers.  From  this  intuitive  base  of 
the  sequence  of  natural  numbers,  any  other  mathematical  object  must  be 
built  in  a  purely  constructive  manner,  employing  a  finite  number  of  steps 
or  operations.  In  the  intuitionist  thesis  we  have  the  genetical  development 
of  mathematics  pushed  to  its  extreme. 

The  intuitionist  school  (as  a  school)  originated  about  1908  with  the 
Dutch  mathematician  L.  E.  J.  Brouwer,  although  one  finds  some  of  the 
intuitionist  ideas  uttered  earlier  by  such  men  as  Kronecker  (in  the  1880’s) 
and  Poincare  (1902-1906).  The  school  has  gradually  strengthened  with  the 
passage  of  time,  has  won  over  some  eminent  present-day  mathematicians, 
and  has  exerted  a  tremendous  influence  on  all  thinking  concerning  the 
foundations  of  mathematics. 

Some  of  the  consequences  of  the  intuitionist  thesis  are  little  short  of 
revolutionary.  Thus  the  insistence  on  constructive  methods  leads  to  a 
conception  of  mathematical  existence  not  shared  by  all  practicing  mathe¬ 
maticians.  For  the  intuitionists,  an  entity  whose  existence  is  to  be  proved 
must  be  shown  to  be  constructible  in  a  finite  number  of  steps;  it  is  not 
sufficient  to  show  that  the  assumption  of  the  entity’s  nonexistence  leads  to  a 
contradiction.  This  means  that  many  existence  proofs  found  in  current 
mathematics  are  not  acceptable  to  the  intuitionists. 

An  important  instance  of  the  intuitionists’  insistence  upon  constructive 
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procedures  is  in  the  theory  of  sets.  For  the  intuitionists,  a  set  cannot  be 
thought  of  as  a  ready-made  collection,  but  must  be  considered  as  a  law  by 
means  of  which  the  elements  of  the  set  can  be  constructed  in  a  step-by-step 
fashion.  This  concept  of  set  rules  out  the  possibility  of  such  contradictory 
sets  as  “the  set  of  all  sets.” 

There  is  another  remarkable  consequence  of  the  intuitionists’  insistence 
upon  finite  constructibility,  and  this  is  the  denial  of  the  universal  acceptance 
of  the  law  of  excluded  middle !  Consider,  for  example,  the  number  *,  which 
is  defined  to  be  (  —  l)fc,  where  k  is  the  number  of  the  first  decimal  place  in  the 
decimal  expansion  of  7 r  where  the  sequence  of  consecutive  digits  123456789 
begins,  and,  if  no  such  k  exists,  x  =  0.  Now,  although  the  number  x  is 
well-defined,  we  cannot  at  the  moment,  under  the  intuitionists’  restrictions, 
say  that  the  proposition  “x  =  0”  is  either  true  or  false.  This  proposition 
can  be  said  to  be  true  only  when  a  proof  of  it  has  been  constructed  in  a 
finite  number  of  steps,  and  it  can  be  said  to  be  false  only  when  a  proof  of 
this  situation  has  been  constructed  in  a  finite  number  of  steps.  Until  one  or 
the  other  of  these  proofs  is  constructed,  the  proposition  is  neither  true  nor 
false,  and  the  law  of  excluded  middle  is  inapplicable.  If,  however,  k  is  further 
restricted  to  be  less  than  a  billion,  say,  then  it  is  perfectly  correct  to  say 
that  the  proposition  is  now  either  true  or  false,  for,  with  k  less  than  a  billion, 
the  truth  or  falseness  can  certainly  be  established  in  a  finite  number  of  steps. 

Thus,  for  the  intuitionists,  the  law  of  excluded  middle  holds  for  finite 
sets  but  should  not  be  employed  when  dealing  with  infinite  sets.  This  state 
of  affairs  is  blamed  by  Brouwer  on  the  sociological  development  of  logic. 
The  laws  of  logic  emerged  at  a  time  in  man’s  evolution  when  he  had  a  good 
language  for  dealing  with  finite  sets  of  phenomena;  he  then  later  made  the 
mistake  of  applying  these  laws  to  the  infinite  sets  of  mathematics,  with  the 
result  that  antinomies  arose. 

In  the  Principia  mathematica ,  the  law  of  excluded  middle  and  the  law  of 
contradiction  are  equivalent.  For  the  intuitionists,  this  situation  no  longer 
prevails,  and  it  is  an  interesting  problem  to  try,  if  possible,  to  set  up  the 
logical  apparatus  to  which  intuitionist  ideas  lead  us.  This  was  done  in  1930 
by  A.  Heyting,  who  succeeded  in  developing  an  intuitionist  symbolic  logic. 
Intuitionist  mathematics  thus  produces  its  own  type  of  logic,  and  mathe¬ 
matical  logic,  as  a  consequence,  is  a  branch  of  mathematics.  In  the  intui¬ 
tionist  program,  mathematical  logic  is  almost  irrelevant;  it  is  subordinate 
and  merely  useful  in  expression  or  communication. 

There  is  the  final  important  question:  How  much  of  existing  mathe¬ 
matics  can  be  built  within  the  intuitionist  restrictions?  If  all  of  it  can  be  so 
rebuilt,  without  too  great  an  increase  in  the  labor  required,  then  the  present 
problem  of  the  foundations  of  mathematics  would  appear  to  be  solved.  Now 
the  intuitionists  have  succeeded  in  rebuilding  large  parts  of  present-day 
mathematics,  including  a  theory  of  the  continuum  and  a  set  theory,  but 
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there  is  a  great  deal  that  is  still  wanting.  So  far,  intuitionist  mathematics  has 
turned  out  to  be  considerably  less  powerful  than  classical  mathematics,  and 
in  many  ways  it  is  much  more  complicated  to  develop.  This  is  the  fault  found 
with  the  intuitionist  approach— too  much  that  is  dear  to  most  mathematicians 
is  sacrificed.  This  situation  may  not  exist  forever,  because  there  remains  the 
possibility  of  an  intuitionist  reconstruction  of  classical  mathematics  carried 
out  in  a  different  and  more  successful  way.  And  meanwhile,  in  spite  of  present 
objections  raised  against  the  intuitionist  thesis,  it  is  generally  conceded  that 
its  methods  do  not  lead  to  contradictions. 

«■ 

Formalism 

The  formalist  thesis  is  that  mathematics  is  concerned  with  formal 
symbolic  systems.  In  fact,  mathematics  is  regarded  as  a  collection  of  such 
abstract  developments,  in  which  the  terms  are  mere  symbols  and  the  state¬ 
ments  are  formulas  involving  these  symbols;  the  ultimate  base  of  mathematics 
does  not  lie  in  logic  but  only  in  a  collection  of  prelogical  marks  or  symbols 
and  in  a  set  of  operations  with  these  marks.  Since,  from  this  point  of  view, 
mathematics  is  devoid  of  concrete  content  and  contains  only  ideal  symbolic 
elements,  the  establishment  of  the  consistency  of  the  various  branches  of 
mathematics  becomes  an  important  and  necessary  part  of  the  formalist 
program.  Without  such  an  accompanying  consistency  proof,  the  whole  study 
is  essentially  senseless.  In  the  formalist  thesis  we  have  the  axiomatic  develop¬ 
ment  of  mathematics  pushed  to  its  extreme. 

The  formalist  school  was  founded  by  David  Hilbert  after  completing 
his  postulational  study  of  geometry.  In  his  Grundlagen  der  Geometrie  (1899), 
Hilbert  had  sharpened  the  postulational  method  from  the  material  axio- 
matics  of  Euclid  to  the  formal  axiomatics  of  the  present  day.  The  formalist 
point  of  view  was  developed  later  by  Hilbert  to  meet  the  crisis  caused  by 
the  paradoxes  of  set  theory  and  the  challenge  to  classical  mathematics 
caused  by  intuitionistic  criticism.  Although  Hilbert  talked  in  formalistic 
terms  as  early  as  1904,  not  until  after  1920  did  he  and  his  collaborators, 
Bernays,  Ackermann,  von  Neumann,  and  others,  seriously  start  work  on 
what  is  now  known  as  the  formalist  program. 

The  success  or  failure  of  Hilbert’s  program  to  save  classical  mathematics 
hinges  upon  the  solution  of  the  consistency  problem.  Freedom  from  contra¬ 
diction  is  guaranteed  only  by  consistency  proofs,  and  the  older  consistency 
proofs  based  upon  interpretations  and  models  usually  merely  shift  the 
question  of  consistency  from  one  domain  of  mathematics  to  another.  In 
other  words,  a  consistency  proof  by  the  method  of  models  is  only  relative. 
Hilbert,  therefore,  conceived  a  new  direct  approach  to  the  consistency 
problem.  Much  as  one  may  prove,  by  the  rules  of  a  game,  that  certain 
situations  cannot  occur  within  the  game,  Hilbert  hoped  to  prove,  by  a  suitable 
set  of  rules  of  procedure  for  obtaining  acceptable  formulas  from  the  basic 
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symbols,  that  a  contradictory  formula  can  never  occur.  In  the  logical  notation 
introduced  earlier  in  this  chapter,  a  contradictory  formula  is  any  formula 
of  the  type  F  A  F',  where  F  is  some  accepted  formula  of  the  system.  If  one 
can  show  that  no  such  contradictory  formula  is  possible,  then  one  has 
established  the  consistency  of  the  system. 

The  development  of  the  above  ideas  of  a  direct  test  for  consistency  in 
mathematics  is  called,  by  Hilbert,  the  “proof  theory.”  Hilbert  and  Bernays 
planned  to  give  a  detailed  exposition  (and  application  to  all  classical  mathe¬ 
matics)  of  the  proof  theory  in  their  great  Grundlagen  der  Mathematik ,  which 
may  be  considered  as  the  “Principia  mathematica”  of  the  formalist  school. 
The  Grundlagen  der  Mathematik  was  finally  published  in  two  volumes,  Volume 
I  in  1934  and  Volume  II  in  1939,  but,  as  the  work  was  being  written, 
unforeseen  difficulties  arose,  and  it  was  not  possible  to  complete  the  proof 
theory.  For  certain  elementary  systems,  proofs  of  consistency  were  carried 
out,  which  illustrated  what  Hilbert  would  like  to  have  done  for  all  classical 
mathematics,  but,  for  the  system  in  toto ,  the  problem  of  consistency  remained 
refractory. 

As  a  matter  of  fact,  the  Hilbert  program,  at  least  in  the  form  originally 
envisioned  by  Hilbert,  appears  to  be  doomed  to  failure;  this  truth  was 
brought  out  by  Kurt  Godel  in  1931,  actually  before  the  publication  of  the 
Grundlagen  had  taken  place.  Godel  showed,  by  unimpeachable  methods 
acceptable  to  the  followers  of  any  one  of  the  three  principal  schools  of  the 
philosophy  of  mathematics,  that  it  is  impossible  for  a  sufficiently  rich  formal¬ 
ized  deductive  system,  such  as  Hilbert’s  system  for  all  classical  mathematics, 
to  prove  consistency  of  the  system  by  methods  belonging  to  the  system.  This 
remarkable  result  is  a  consequence  of  an  even  more  fundamental  one; 
Godel  proved  the  incompleteness  of  Hilbert’s  system— that  is,  he  established 
the  existence  within  the  system  of  “undecidable”  problems,  of  which  con¬ 
sistency  of  the  system  is  one.  These  theorems  of  Godel  are  too  difficult  to 
consider  in  their  technical  details  here.  They  are  certainly  among  the  most 
remarkable  in  all  mathematics,  and  they  reveal  an  unforeseen  limitation 
in  the  methods  of  formal  mathematics.  They  show  “that  the  formal  systems 
known  to  be  adequate  for  the  derivation  of  mathematics  are  unsafe  in  the 
sense  that  their  consistency  cannot  be  demonstrated  by  finitary  methods 
formalized  within  the  system,  whereas  any  system  known  to  be  safe  in  this 
sense  is  inadequate.”* 

*F.  De  Sua.  Here  we  also  find  the  following  interesting  remark:  “Suppose  we  loosely 
define  a  religion  as  any  discipline  whose  foundations  rest  on  an  element  of  faith, 
irrespective  of  any  element  of  reason  which  may  be  present.  Quantum  mechanics 
for  example  would  be  a  religion  under  this  definition.  But  mathematics  would  hold 
the  unique  position  of  being  the  only  branch  of  theology  possessing  a  rigorous  demon¬ 
stration  of  the  fact  that  it  should  be  so  classified.”  See  also  the  Appendix,  Section  A. 7. 


306  LOGIC  AND  PHILOSOPHY 


_ PROBLEMS 

9.1.1  By  the  use  of  truth  tables  establish  the  laws  of  logic  tabulated  in 
Section  9.1. 

9.1.2  By  the  use  of  truth  tables  determine  whether  or  not  the  following 
propositions  are  tautologies: 

(a)  (p  A  q)  — »  p. 

(b)  (p  V  q)  — >  p. 

(c)  (//  — ►  p)  <->  p. 

(d)  (p  A  q)  (q  A  /?). 

(e)  [/?  — >  (/?  A  r)  ]  [(/>  — >  q)  A  (/?  — >  r)  ]. 

Of)  [M  (?  V  r)  ]  [(/>  A  q)  V  (/>  A  r)  ]. 

(g)  [0  A  ?')  ->/>']  <->  (/>->  ?). 

(h)  [(/>  — >  ?)  — »  r]  —>  [/>-»(?—>  r)  ]. 

9.1.3  Let  P  denote  a  composite  proposition  involving  n  primary  proposi¬ 
tions.  Show  that  the  truth  table  for  P  contains  2n  lines. 

9.1.4  Establish  the  logical  equivalence  of  the  pairs  of  propositions  tabulated 
at  the  end  of  Section  9.1. 

9.1.5  Express  the  following  propositions  in  terms  of  disjunction  and  nega¬ 
tion  symbols  only: 

(a)  (p  V  p)  — >  p. 

(b)  q  ->  (p  V  q). 

(c)  Ip  V  q)  — >  (q  V  p). 

(d)  (q  — >  r)  — >  [(/>  V  ?)  — >  (p  V  r)  ]. 

(e)  0  V  (?  V  r)  ]  ->  {q  V  (/>  V  r)  ]. 

9.1.6  Let  us  designate  or  when  used  in  the  exclusive  sense  by  V,  so  that 
“P  V  q”  means  “p  or  q  but  not  both.”  Show  that  we  may  express  p  V  q  by 

(a)  (p  V  q)  A  (p  A  ?)'. 

(b)  (/>  A  <?')  V  (/>'  A  q). 

9.1.7  By  the  use  of  truth  tables  show  that  the  following  pairs  of  propositions 
are  logically  equivalent: 

(a)  (p  A  q)f  and  p'  V  q'. 

(b)  (p  V  q)'  and  p'  A  q'. 

(c)  (p  — >  qY  and  p  A  q'. 

(d)  (p  «->  qY  and  p  <-»  q'. 

(e)  (p  q)'  and  p'  «-»  q. 

9.1.8  Form  the  negation  of  each  of  the  following  propositions,  reducing  the 
result  in  each  case  to  one  in  which  the  negation  sign  is  applied  only  to  the 
propositions  /?,  qf  r : 

(a)  p'  ->  q. 

(b)  (p —>  q)  A  r. 

(c)  (p  A  q)  V  r. 

(d)  p  — ►  q'. 

(e)  p'  (p  A  q'). 

(f)  (p  V  q)  A  p. 
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9.1.9  Related  to  the  proposition  “ p  — >  g”  are  the  following  three  proposi¬ 
tions:  (1)  the  converse ,  q  — »  p,  (2)  the  inverse,  p'  — >  q' ,  (3)  the  contrapositive , 
qf  — »  p'.  Show  that 

(a)  the  converse  of  a  true  implication  is  not  always  true, 

(b)  the  inverse  of  a  true  implication  is  not  always  true, 

(c)  the  contrapositive  of  a  true  implication  is  always  true, 

(d)  the  contrapositive  of  an  implication  is  the  converse  of  the  inverse 
of  the  implication. 

(e)  Is  the  inverse  of  the  converse  of  an  implication  the  same  as  the 
converse  of  the  inverse  of  the  implication? 

9.1.10  Write  the  converse,  inverse,  and  contrapositive  of  each  of  the 
following  implications: 

(a)  q  p,  (b)  pr  ->  q',  (c)  q'  ->  p' ,  (d)  p  q' ,  (e)  p'  q. 

9.1.11  It  is  common  in  mathematics  to  state  theorems  in  forms  like  (1)  “a 
necessary  condition  that  p  be  true  is  that  q  be  true,”  (2)  “a  sufficient  con¬ 
dition  that  p  be  true  is  that  q  be  true,”  (3)  “a  necessary  and  sufficient 
condition  that  p  be  true  is  that  q  be  true.”  By  definition,  these  three  forms 
mean,  respectively,  (1)  “p  — >  q”  (2)  “q  — >  />,”  (3)  “(p  — >  q)  A  (q  — >  /?)•” 

(a)  Show  that  (3)  is  equivalent  to  “p  «->  q .” 

(b)  Show  that  in  order  to  establish  a  necessary  and  sufficient  condition 
that  p  be  true  one  must  establish  both  a  theorem  and  its  converse. 

9.1.12  If  the  last  column  of  the  truth  table  for  a  composite  proposition  m 
contains  only  F’s,  then  m  is  called  an  absurdity ;  if  the  last  column  contains 
both  T’s  and  F’s,  then  m  is  called  a  contingency. 

(a)  Show  that  p  A  p'  is  an  absurdity. 

(b)  Show  that  [(p  A  q)  — ->  q]'  is  an  absurdity. 

(c)  Show  that  if  m  is  an  absurdity,  then  m'  is  a  tautology. 

(d)  Show  that  p'  A  q’  A  r'  is  a  contingency. 

(e)  Show  that  p  V  q  V  r  is  a  contingency. 

(0  Show  that  if  m  is  a  contingency,  then  m'  is  a  contingency. 

9.1.13  Show  that  negation  is  indispensable  for  expressing  combinations  of 
propositions. 

9.1.14  Take  pjq  to  mean  p'  V  qr . 

(a)  Show  the  logical  equivalence  of  pip  and  p’ . 

(b)  Show  the  logical  equivalence  of  (p/p)/(qfq)  and  p  V  q. 

(c)  Express  A  in  terms  of  /. 

(d)  Express  — »  in  terms  of  /. 

(e)  Express  <->•  in  terms  of  /. 

9.2.1  Establish  the  following  additional  useful  rules  for  obtaining  new 
tautologies  from  given  tautologies  in  the  calculus  of  propositions: 

(a)  R5:  If  m  V  m  is  a  tautology,  then  m  is  a  tautology. 

(b)  R6:  If  mis  a  tautology  and  p  is  any  proposition,  then  p  V  m  is  a  tautology. 

(c)  R7:  If  m  V  n  is  a  tautology ,  then  n  V  m  is  a  tautology. 

(d)  R8:  If  m  — >  n  is  a  tautology  and  p  is  any  proposition ,  then  (p  V  m)  — » 

(p  V  n)  is  a  tautology. 

(e)  R9:  If  p  — *  q  and  q  — >  r  are  tautologies,  then  p  — >  r  is  a  tautology. 
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9.2.2  (a)  Show,  from  the  truth  table  for  implication,  that  “if  p  is  true  and 
if  p  — >  <7,  then  q  is  true.” 

(b)  Because  of  (a),  Postulate  LI  asserts  that  “if  p  or  p  is  true,  then  p 
is  true.”  Give  similar  interpretations  of  Postulates  L2,  L3,  L4  and  of 
Theorems  1,  2,  and  3. 

9.2.3  Show  that  if,  by  definition,  p  means  q,  then  p  q  is  a  tautology. 

9.2.4  The  following  rule  may  be  proved:  RIO:  If  p  <->  q  is  a  tautology ,  then 
in  any  tautology  involving  proposition  p  we  may  replace  p  in  any  of  its  occurrences  by  q 
and  thus  obtain  another  tautology.  Using  this  rule,  prove 

(a)  Theorem  15. 

(b)  Theorem  23. 

9.2.5  (a)  Establish  the  tautology:  (p  — »  q')  — >  [q  — >  p')m 

(b)  Establish  Theorem  16. 

(c)  If  m  <-*  n  is  a  tautology,  show  that  m'  n'  is  a  tautology. 

9.2.6  (a)  Establish  Theorem  21 . 

(b)  Establish  the  tautology  p  — *  (p  V  q). 

(c)  Establish  Theorem  22. 

9.2.7  Verify,  by  truth  tables,  that  the  postulates  of  Boolean  algebra  become 
tautologies  under  the  interpretations  made  at  the  end  of  Section  9.2. 

9.2.8  Convert  the  following  relations  in  Boolean  algebra  into  tautologies 
in  the  calculus  of  propositions: 

(a)  aC\(b  r\c)  =  (a  r\  b)  r\c. 

(b )  a  r\  b  =  b  r\  a. 

(c)  a  r\  a  =  a. 

(d)  a  r\  (a  b)  —  a. 

9.2.9  (a)  What  is  the  counterpart  in  Boolean  algebra  of  the  implication 
relation  a  — >  b  of  the  calculus  of  propositions? 

(b)  What  is  the  counterpart  in  the  calculus  of  propositions  of  the 
inclusion  relation  a  C  b  of  Boolean  algebra? 

9.2.10  From  each  of  the  following  tautologies  find  another  tautology  by 
the  principle  of  duality: 

(a)  [{p  A  p')  V  q]*-+q. 

(b)  (p  V  q)  (//  A  q')r. 

(c)  The  answers  to  Problem  9.2.8. 

9.2.11  Establish  the  absolute  consistency  of  the  calculus  of  propositions  by 
showing  that  it  is  impossible  to  obtain  from  the  primitive  tautologies  LI,  L2, 
L3,  L4,  by  the  rules  Rl,  R2,  R3,  two  contradictory  tautologies  s  and  s'. 

9.3.1  .  Using  the  definition,  p  V  q  means  (//  A  q')',  construct  truth  tables 
for  disjunction  in  a  three-valued  logic  using  the  truth  tables  of 

(a)  Figures  43a  and  44a; 

(b)  Figures  43a  and  44b; 

(c)  Figures  43b  and  44a; 

(d)  Figures  43b  and  44b. 
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9.3.2  Using  the  definition,  p  — >  q  means  (p  A  q')',  construct  truth  tables 
for  implication  in  a  three-valued  logic  using  the  truth  tables  of 

(a)  Figures  43a  and  44a; 

(b)  Figures  43a  and  44b; 

(c)  Figures  43b  and  44a; 

(d)  Figures  43b  and  44b. 

9.3.3  Verify  that  there  are  12  possible  truth  tables  for  negation  in  a  three¬ 
valued  logic. 

9.3.4  (a)  Show  that  for  the  truth  table  in  Figure  44a,  [(//)']'  is  the  same  as p. 
(b)  Show  that  for  the  truth  table  in  Figure  44b,  ( p ')'  is  the  same  as  p. 

9.3.5  How  many  possible  m-valued  logics  are  there  analogous  to  the  3072 
possible  three-valued  logics  indicated  in  Section  9.3? 

9.3.6  Suppose  we  wish  to  define  the  implication  relation,  p  — >  q,  subject  to 
the  restriction  that  when  p  and  p  — >  q  are  both  true,  then  q  is  also  true. 
How  many  such  implication  relations  are  possible  in  a  (a)  two-valued  logic? 
(b)  three-valued  logic? 

9.4.1  Consider  the  following  popularizations  of  the  Russell  paradox: 

(a)  Every  municipality  of  a  certain  country  must  have  a  mayor,  and 
no  two  municipalities  may  have  the  same  mayor.  Some  mayors  do  not  reside 
in  the  municipalities  they  govern.  A  law  is  passed  compelling  nonresident 
mayors  to  reside  by  themselves  in  a  certain  special  area  A.  There  are  so 
many  nonresident  mayors  that  A  is  proclaimed  a  municipality.  Where  shall 
the  mayor  of  A  reside? 

(b)  An  adjective  in  the  English  language  is  said  to  be  autological  if  it 
applies  to  itself;  otherwise  the  adjective  is  said  to  be  heterological.  Thus  the 
adjectives  “short,”  “English,”  and  “polysyllabic”  all  apply  to  themselves 
and  hence  are  autological,  whereas  the  adjectives  “long,”  “French,”  and 
“monosyllabic”  do  not  apply  to  themselves  and  hence  are  heterological. 
Now  is  the  adjective  “heterological”  autological  or  heterological? 

(c)  Suppose  a  librarian  compiles,  for  inclusion  in  his  library,  a  bibliog¬ 
raphy  of  all  those  bibliographies  in  his  library  which  do  not  list  themselves. 

9.4.2  Examine  the  following  paradox.  Every  natural  number  can  be 
expressed  in  simple  English,  without  the  use  of  numerical  symbols.  Thus  5 
can  be  expressed  as  “five,”  or  as  “half  of  ten,”  or  as  “the  second  odd  prime, 
or  as  “the  positive  square  root  of  twenty-five,”  and  so  on.  Now  considei 
the  expression,  “the  least  natural  number  not  expressible  in  fewer  than 
twenty-three  syllables.”  This  expression  expresses  in  twenty-two  syllables  a 
natural  number  which  cannot  be  expressed  in  fewer  than  twenty-three 
syllables. 

9.4.3  Consider  the  following  logical  dilemmas: 

(a)  A  crocodile,  which  has  stolen  a  child,  promises  the  child  s  father 
to  return  the  child  provided  the  father  guesses  whether  the  child  will  be 
returned  or  not.  WTat  should  the  crocodile  do  if  the  father  guesses  that  the 
child  will  not  be  returned? 
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(b)  An  explorer  has  been  captured  by  cannibals  who  offer  the  explorer 
the  opportunity  to  make  a  statement  under  the  condition  that  if  it  is  true 
he  will  be  boiled  and  if  it  is  false  he  will  be  roasted.  What  should  the  canni¬ 
bals  do  if  the  explorer  states,  “I  will  be  roasted”? 

9.4.4  Show  that  the  statement,  “Every  general  statement  has  its  excep¬ 
tions,”  is  self-contradictory. 

9.4.5  Consider  the  following  questions: 

(a)  What  would  happen  if  an  irresistible  force  should  collide  with  an 
immovable  body? 

(b)  If  Zeus  can  do  anything,  can  he  make  a  stone  that  he  cannot  lift? 

9.4.6  Assuming  an  unrestricted  universe,  show  that  the  set  M  of  all  sets 
which  are  members  of  themselves  is  equivalent  to  the  set  jV  of  all  sets  which 
are  not  members  of  themselves. 
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A.l  The  First  Twenty-eight 
Propositions  of  Euclid 

Following  are  the  statements,  in  modern  terminology,  of  the  first  28 
propositions  of  Book  I  of  Euclid’s  Elements.  The  proofs  of  these  propositions 
do  not  require  the  parallel  postulate. 

1.  To  construct  an  equilateral  triangle  on  a  given  straight  line  segment. 

2.  From  a  given  point  to  draw  a  straight  line  segment  equal  to  a  given 
straight  line  segment. 

3.  From  the  greater  of  two  given  straight  line  segments  to  cut  off  a  part 
equal  to  the  smaller. 

4  Two  triangles  are  congruent  if  two  sides  and  the  included  angle  of  one 
are  equal  to  two  sides  and  the  included  angle  of  the  other. 

5.  The  base  angles  of  an  isosceles  triangle  are  equal. 

6.  If  two  angles  of  a  triangle  are  equal,  the  sides  opposite  these  angles 
are  equal. 

7.  There  cannot  be  two  different  triangles  on  the  same  side  of  a  common 
base  and  having  their  other  pairs  of  corresponding  sides  equal. 
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8.  Two  triangles  are  congruent  if  the  three  sides  of  one  are  equal  to  the 
three  sides  of  the  other. 

9.  To  bisect  a  given  angle. 

10.  To  bisect  a  given  straight  line  segment. 

11.  To  erect  a  perpendicular  to  a  given  straight  line  at  a  given  point  on 
the  line. 

12.  To  drop  a  perpendicular  from  a  given  point  to  a  given  straight  line. 

13.  Any  pair  of  adjacent  angles  formed  by  two  intersecting  straight  lines 
are  either  two  right  angles  or  are  together  equal  to  two  right  angles. 

14.  If,  at  any  point  on  a  straight  line,  two  rays,  on  opposite  sides  of  it, 
make  a  pair  of  adjacent  angles  together  equal  to  two  right  angles,  then 
these  two  rays  lie  in  the  same  straight  line. 

15.  Any  pair  of  vertical  angles  formed  by  two  intersecting  straight  lines 
are  equal. 

16.  An  exterior  angle  of  a  triangle  is  greater  than  either  remote  interior 
angle. 

17.  Any  two  angles  of  a  triangle  are  together  less  than  two  right  angles. 

18.  In  a  triangle  the  greater  side  is  opposite  the  greater  angle. 

19.  In  a  triangle  the  greater  angle  is  opposite  the  greater  side. 

20.  The  sum  of  any  two  sides  of  a  triangle  is  greater  than  the  third  side. 

21.  If  from  the  ends  of  a  side  of  a  triangle  there  be  drawn  two  straight  line 
segments  to  a  point  within  the  triangle,  these  two  segments  together 
will  be  less  than  the  other  two  sides  together,  but  will  contain  a  greater 
angle. 

22.  To  construct  a  triangle  having  sides  equal  to  three  given  straight  line 
segments,  any  two  of  which  are  together  greater  than  the  third. 

23.  At  a  given  point  on  a  given  straight  line  to  construct  an  angle  equal  to 
a  given  angle. 

24.  If  two  triangles  have  two  sides  of  one  equal  to  two  sides  of  the  other, 
but  the  included  angle  of  the  first  is  greater  than  the  included  angle 
of  the  second,  then  the  third  side  of  the  first  is  greater  than  the  third 
side  of  the  second. 

25.  If  two  triangles  have  two  sides  of  one  equal  to  two  sides  of  the  other, 
but  the  third  side  of  the  first  is  greater  than  the  third  side  of  the  second, 
then  the  included  angle  of  the  first  is  greater  than  the  included  angle 
of  the  second. 

26.  Two  triangles  are  congruent  if  two  angles  and  a  side  of  one  are  equal 
to  two  angles  and  the  corresponding  side  of  the  other. 

27.  If  a  transversal  of  two  straight  lines  makes  a  pair  of  alternate  interior 
angles  equal,  then  the  two  lines  are  parallel. 

28.  If  a  transversal  of  two  straight  lines  makes  a  pair  of  corresponding  angles 
equal,  or  a  pair  of  interior  angles  on  the  same  side  of  the  transversal 
equal  to  two  right  angles,  then  the  two  lines  are  parallel. 
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A. 2  Removal  of  Some  Redundancies 

In  Section  7.2  we  listed  a  postulate  set  for  a  complete  ordered  field.  The 
first  ten  of  these  postulates  characterize  a  field.  We  shall  show  that  in  this 
postulate  set  for  a  field,  Postulates  PI  and  P9  are  redundant,  and  can 
therefore  be  deleted,  thus  reducing  the  postulate  set  for  a  field  to  one  of 
eight  simple  postulates.  The  deduction  of  Postulate  P9  from  Postulates  P4, 
P7,  P10  is  very  simple  and  well  known.  The  deduction  of  Postulate  PI  from 
P2,  P3,  P5,  P6,  P7,  P8  is  more  difficult  and  not  so  well  known.  For  simplicity 
we  shall  denote  the  addition  operation  of  the  field  by  +  and  the  multipli¬ 
cation  operation  by  juxtaposition. 

Theorem  1  (this  is  P9).  If  a,  b,  c  are  elements  of  a  field  S,  c  is  not  the 
Zero  element  z  of  S,  and  ac  =  be,  then  a  =  b. 

For  we  have 


au 

(by  P7) 

a(cc~~l) 

(by  P10) 

(ac)c~l 

(by  P4) 

( bc)c~ 1 

(substitution) 

b(cc~1) 

(by  P4) 

bu 

(by  P10) 

b. 

(by  P7) 

Theorem  2  (this  is  PI).  If  a.  and  b  are  elements  of  a  field  S,  then  a  +  b  = 

b  +  a. 

Postulates  P3,  P6,  P8  assure  us  that  the  elements  of  a  field  constitute 
a  group  with  respect  to  addition.  It  follows  (see  Theorems  1  and  5  of  Section 
5.3)  that  both  the  right  and  the  left  cancellation  laws  for  addition  hold  in 
a  field.  But 


(a  -j-  b)  (u  ~p  u ) 

—  (a  -|-  b)u  ~p  {a  -p  b)u 

(by  P5) 

—  u(a  -p  b)  -p  u(a  -p  b) 

(by  P2) 

=  ( ua  +  ub)  +  (ua  -p  ub) 

(by  P5) 

=  ( au  +  bu)  +  ( au  +  bu) 

(by  P2) 

=  (a  +  b)  +  (a  +  b) 

(by  P7) 

=  [{a  +  b)  +  a  ]  +  b 

(by  P3) 

=  [a  -\-  (b  a)}  b. 

(by  P3) 

(a  -|-  b)  (u  +  u ) 

=  (u  +  u)(a  +  b) 

(by  P2) 

=  (m  +  u)a  +  {u  -p  u)b 

(by  P5) 

—  a{u  +  u)  +  b(u  +  u) 

(by  P2) 

=  (au  +  au)  +  (bu  +  bu) 

(by  P5) 

=  (a  +  a)  +  (b  +  b) 

(by  P7) 

=  [(a  +  a)  +  b  ]  +  b 

(by  P3) 

=  [a  +  (a  +  b)  ]  +  b. 

(by  P3) 
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It  now  follows  that  [a  +  (a  +  6)  ]  +  b  =  [a  +  (b  +  a)  ]  -f  b,  whence, 
by  an  application  of  the  right  cancellation  law  for  addition  followed  by  an 

application  of  the  left  cancellation  law  for  addition,  we  find  a  +  b  =  b  -f-  a. 

\ 

The  examination  of  the  truncated  postulate  set  P2,  P3,  P4,  P5,  P6,  P7, 
P8,  P10  of  Section  7.2  for  independence  or  for  further  possible  redundancies 
constitutes  a  study  that  is  recommended  to  the  interested  reader. 


In  Section  7.3  we  listed  a  postulate  set  for  the  natural  number  system, 
and  remarked  that  Postulates  Nl,  N2,  N4,  N7,  and  N8  of  the  set  are 
redundant.  It  is  interesting  that  the  redundance  of  N7  and  N8  was  observed 
apparently  only  recently.*  We  now  proceed  to  deduce  Postulates  Nl,  N2, 
N4,  N7,  and  N8  from  the  remaining  postulates  N3,  N5,  N6,  N9,  and  N10. 

Theorem  3.  If  n  is  a  natural  number ,  then  n  +  1  =  1  +  n. 

Let  M  be  the  set  of  all  natural  numbers  n  such  that  n  +  1  =  1  +  n. 
Clearly  1  is  in  M.  Suppose  k  is  in  M ;  that  is,  suppose  k  +  1  =  1  +  k.  Then 

(&  +  1)  +  1  =  (1  +  A;)  +  1  (by  supposition) 

=  1  +  (k  +  1),  (by  N3) 

and  k  +  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 


Theorem  4  (this  is  Nl).  If  a  and  b  are  any  natural  numbers ,  then  a  +  b  = 
b  T  a. 

Let  M  be  the  set  of  all  natural  numbers  b  for  which  a  -j-  b  =  b  -f-  a. 
By  Theorem  3,  1  is  in  M.  Suppose  k  is  in  M\  that  is,  suppose  a  +  k  =  k  +  a. 
Then 


a  (k  1)  —  {a  -\r  k)  -\-  1 
=  (k  +  a)  +  1 
=  k  +  {a  +  1) 
=  k  +  (1  +  a) 
=  (k  +  1)  +  a, 


(by  N3) 
(by  supposition) 
(by  N3) 
(by  Theorem  3) 
(by  N3) 


and  k  +  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 


Theorem  5.  If  n  is  any  natural  number ,  then  In  =  n. 

Let  M  be  the  set  of  all  natural  numbers  n  such  that  1  n  =  n.  Cleariy 
1  is  in  Af,  by  N6.  Suppose  k  is  in  M;  that  is,  suppose  1  k  =  k.  Then 

\{k  +  1)  =  U  +  1  •  1  (by  N5) 

=  k  +  1  -1  (by  supposition) 

=  k  +  1,  (by  N6) 

and  A  -f-  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 


*See  P.  H.  Diananda  (Bibliography). 
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Theorem  6.  If  a,  b,  c  are  any  natural  numbers ,  then  (b  +  c)a  =  ba  +  ca. 
Let  M  be  the  set  of  all  natural  numbers  a  such  that  {b  +  c)a  =  ba  +  ca. 


Now 

{b  +  c)l  =  b  +  c  (by  N6) 

=  b\  +  cl,  (by  N6) 

and  1  is  in  M.  Suppose  k  is  in  M ;  that  is,  suppose  {b  +  c)k  =  bk  +  ck.  Then 


(b  +  c){k  +  1)  —  (b  -+*  c)k  +  (b  +  c)  1 

=  {bk  T  ck)  T  {b  T  c)\ 

=  {bk  T  ck)  T  {b  T  c) 

=  [{bk  T  ck)  T  b }  T  c 

—  [bk  -j-  {ck  T  b)  ]  T  c 

=  [bk  T  {b  T  ck)  ]  T  c 

=  [{bk  +  b)  +  ck]  +  c 

—  {bk  T  b)  T  {ck  +  c) 

=  {bk  +  b\)  +  {ck  +  cl) 

=  b{k  +  1)  +  c{k  +  1), 

and  k  +  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 


(by  N5) 
(by  supposition) 
(by  N6) 
(by  N3) 
(by  N3) 
(by  Theorem  4) 
(by  N3) 
(by  N3) 
(by  N6) 
(by  N5) 


Theorem  7  (this  is  N2).  If  a  and  b  are  any  natural  numbers ,  then  ab  =  ba. 
Let  M  be  the  set  of  all  natural  numbers  b  such  that  ab  —  ba.  Now 


a\ 


=  a 

=  la. 


and  1  is  in  M.  Suppose  k  is  in  M;  that  is,  suppose  ak  =  ka 


a{k  T  1) 


ak  +  a\ 
ka  +  a  1 
ka  T  a 

k a  T*  la 


(by  N6) 
(by  Theorem  5) 
,  Then 

(by  N5) 
(by  supposition) 
(by  N6) 
(by  Theorem  5) 
(by  Theorem  6) 


—  {k  +  l)a, 

and  k  f-  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 


Theorem  8  (this  is  N4).  If  a,  b,  c  are  any  natural  numbers,  then  (ab)c  — 
a(bc). 

Let  M  be  the  set  of  all  natural  numbers  c  such  that  {ab)c  —  a{bc).  Now 


{ab)  1  =  ab  (by  N6) 

=  a{bl),  (by  N6) 

and  1  is  in  M.  Suppose  k  is  in  M ;  that  is,  suppose  (ab)k  =  a{bk).  Then 

{ab){k  +  1)  =  {ab)k  +  {ab)l  (by  N5) 

=  a{bk)  +  {ab)  1  (by  supposition) 

=  a{bk)  +  a{bl)  (since  1  is  in  M) 

=  a[bk  +  bl  ]  (by  N5) 

=  a[b{k  +  1)],  (by  N5) 


and  k  +  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 
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Theorem  9  (this  is  N7).  Ij  a,  b,  c  are  natural  numbers  and  if  c  -f-  a  = 
c  +  b,  then  a  =  b. 

By  N9  one  and  only  one  of  the  following  holds:  a  =  b,  a  +  x  =  b, 
a  =  b  -f~  y,  where  *  and  y  are  natural  numbers. 

Suppose  a  x  =  b.  Then 

c-\-b  =  c-\-  {a  -\-x)  (by  supposition) 

=  (c  +  a)  +  x.  (by  N3) 

Since,  by  hypothesis,  c  +  b  =  c  -j-  a,  the  above  result  contradicts  N9. 

Suppose  a  =  b  -f-  y.  Then 

cJra  =  cJr(b-\-y)  (by  supposition) 

—  (c  +  b)  +  y.  (by  N3) 

Since,  by  hypothesis,  c  +  a  =  c  +  b,  the  above  result  contradicts  N9. 

It  now  follows,  by  reductio  ad  absurdum ,  that  a  =  b. 

Theorem  10  (this  is  N8).  If  a,  b,  c  are  natural  numbers  and  if  ca  =  cb, 
then  a  =  b. 

By  N9  one  and  only  one  of  the  following  holds:  a  =  b,  a  -\-  x  =  b, 
a  —  b  +  y,  where  x  and  y  are  natural  numbers. 

Suppose  a  +  x  =  b.  Then 

cb  =  c(a  +  *)  (by  supposition) 

=  ca  +  cx.  (by  N5) 

Since,  by  hypothesis,  cb  =  ca ,  the  above  result  contradicts  N9. 

Suppose  a  =  b  +  y.  Then 

ca  =  c(b  +  y)  (by  supposition) 

=  cb  +  cy.  (by  N5) 

Since,  by  hypothesis,  ca  =  cb ,  the  above  result  contradicts  N9. 

It  now  follows,  by  reductio  ad  absurdum ,  that  a  =  b. 

In  view  of  Theorems  4,  7,  8,  9,  10  it  follows  that  the  natural  number 
system  can  be  made  to  rest  on  the  five  postulates  N3,  N5,  N6,  N9,  N10. 
In  fact,  we  can  even  weaken  N5  to  N5': 

N5':  If  a  and  b  are  in  N,  then  a(b  +  1)  =  ab  +  al. 

To  prove  this  we  deduce  N5  from  N3,  N5',  and  N10. 

Theorem  11.  N5  is  implied  by  N3,  N5',  and  N10. 

Let  M  be  the  set  of  all  natural  numbers  c  such  that  a(b  +  c)  =  ab  +  ac. 
Clearly,  by  N5',  1  is  in  M.  Suppose  k  is  in  M;  that  is,  suppose  a(b  +  k)  = 
ab  +  ah.  Then 

a  [b  +  (k  +  1 )  ]  =  a  [(6  +  k)  +  1  ]  (by  N3) 

=  a(b  +  k)  +  a\  (by  N5') 

=  (ab  +  ah)  +  a\  (by  supposition) 

=  ab  +  (ah  +  fll)  (by  N3) 

=  ab  +  a(k  +  1),  (by  N5') 

and  k  +  1  is  in  M.  It  follows,  by  N10,  that  M  contains  all  the  natural 
numbers,  and  the  theorem  is  established. 
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It  is  interesting  that  the  reduced  postulate  set  N3,  N6,  N5',  N9,  N10 
for  the  natural  number  system  is  independent.  Perhaps  the  reader  might 
care  to  show  this  by  considering  the  following  five  interpretations,  wherein 
we  designate  the  addition  operation  by  ©,  the  multiplication  operation  by 
(X),  and  the  unity  element  by  u. 

Independence  of  N3.  Let  N  be  the  set  of  natural  numbers  with 

a  ©  b  defined  as  a  +  b,  where  a  =  a  if  b  <  5,  a  =  5  [a/5  ]  if  b  ^  5, 
where  [*]  denotes  the  greatest  integer  not  exceeding  x, 
a  ©  b  defined  as  ((( a  ©  a)  ©  a)  ©  ...)©«,  where  there  are  b  a’s, 
u  taken  as  1. 

Independence  of  N6.  Let  N  be  the  set  of  even  natural  numbers  with 
a  ©  b  defined  as  a  +  b, 
a  ©  b  defined  as  ab, 
u  taken  as  2. 

Independence  of  NS'.  Let  JV  be  the  set  of  natural  numbers  with 
a  ©  b  defined  as  a  +  b, 
a  ©  b  defined  as  a , 
u  taken  as  1. 

Independence  of  N9.  Let  N  be  the  set  {0,  1,  2,  3)  with 
a  ©  b  defined  as  a  -f-  b  (mod  4), 
a  (x)  b  defined  as  ab  (mod  4), 
u  taken  as  1. 

Independence  of  N10.  Let  N  be  the  set  of  all  real  numbers  with 
a  ©  b  defined  as  a  +  b, 
a  ©  b  defined  as  ab , 
u  taken  as  1 . 

The  reader  may  further  care  to  try  to  show  the  complete  independence 
of  Postulates  N3,  N6,  N5',  N9,  N10. 

A. 3  Euclidean  Constructions 

We  noted,  in  Section  2.3,  that  the  first  three  of  Euclid’s  postulates  state 
the  primitive  constructions  from  which  all  other  constructions  in  the  Elements 
are  to  be  compounded.  The  first  two  of  these  postulates  tell  us  what  we  can 
do  with  a  Euclidean  straightedge;  we  are  permitted  to  draw  as  much  as  may 
be  desired  of  the  straight  line  determined  by  two  given  points.  The  third 
postulate  tells  us  what  we  can  do  with  Euclidean  compasses;  we  are  per¬ 
mitted  to  draw  the  circle  of  given  center  and  passing  through  a  given  point. 
Note  that  neither  instrument  is  to  be  used  for  transferring  distances.  This 
means  that  the  straightedge  cannot  be  marked,  and  the  compasses  must  be 
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regarded  as  having  the  characteristic  that  if  one  or  both  points  be  lifted 
from  the  paper,  the  instrument  immediately  collapses.  For  this  reason, 
Euclidean  compasses  are  often  referred  to  as  collapsing  compasses',  they  differ 
from  modern  compasses ,  which  retain  their  opening  and  hence  can  be  used  as 
dividers  for  transferring  distances.  It  would  seem  that  modern  compasses 
might  be  more  powerful  than  the  collapsing  compasses.  Curiously  enough, 
such  turns  out  not  to  be  true;  any  construction  performable  with  the  modern 
compasses  can  also  be  carried  out  (in  perhaps  a  longer  way)  by  means  of 
the  collapsing  compasses.  We  prove  this  fact  as  our  first  theorem. 

Theorem  1.  The  collapsing  and  modern  compasses  are  equivalent. 

The  circle  with  center  0  and  passing  through  a  given  point  C  will  be 
denoted  by  0(C),  and  the  circle  with  center  0  and  radius  equal  to  a  given 


FIG.  45 

segment  AB  will  be  denoted  by  O(AB).  To  prove  the  theorem  it  suffices  to 
show  that  we  may,  with  collapsing  compasses,  construct  any  circle  O(AB). 
This  may  be  accomplished  as  follows  (see  Figure  45).  Draw  circles  A(0)  and 
0(A)  to  intersect  in  D  and  E;  draw  circles  D(B)  and  E(B)  to  intersect  again 
in  F;  draw  circle  0(F).  It  is  an  easy  matter  to  prove  that  OF  =  AB,  whence 
circle  0(F)  is  circle  O(AB). 

Note.  In  view  of  Theorem  1,  we  may  dispense  with  the  Euclidean, 
or  collapsing,  compasses,  and  in  their  place  employ  the  simpler  modern 
compasses.  We  are  assured  that  the  set  of  constructions  performable  with 
straightedge  and  modern  compasses  is  the  same  as  the  set  performable  with 
straightedge  and  Euclidean  compasses. 

We  now  proceed  to  establish  a  chain  of  theorems  which  will  furnish 
us  with  a  criterion  for  Euclidean  constructibility.  It  turns  out  that  the 
criterion  is  algebraic  in  nature.  After  the  criterion  is  established,  we  shall 
apply  it  to  prove  that  the  three  famous  construction  problems  of  antiquity 
(trisection  of  a  general  angle,  duplication  of  a  cube,  and  squaring  of  a  circle) 
are  impossible  with  Euclidean  tools.  A  proof  of  this  fact  was  not  discovered 
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until  the  nineteenth  century,  more  than  two  thousand  years  after  the  original 
problems  had  been  proposed.* 

Theorem  2.  Given  line  segments  of  lengths  a  and  b,  and  a  unit  segment ,  we 


may  construct  with  Euclidean  tools  segments  of  lengths  a  +  b, 
and  V^a. 


a  —  b  ,  ab,  a/b? 


The  first  two  constructions  are  trivial;  the  last  three  are  apparent  from 
Figures  46,  47,  and  48. 


Definition  1.  Let  a\,  •  •  *,  an  be  a  given  set  of  distinct  real  nonzero 
numbers,  and  let  F0  denote  the  set  of  all  numbers  each  of  which  can  be 
obtained  from  a\,  •  •  •,  an  by  a  finite  number  of  additions,  subtractions, 
multiplications,  and  permissible  divisions.  Since  T0  is  closed  under  addition, 
subtraction,  multiplication,  and  permissible  division,  To  is  a  number  field 
(see  Section  5.1);  it  will  be  called  the  number  field  generated  by  ah  •  •  •,  an. 
The  number  field  generated  by  the  number  1  will  be  denoted  by  To- 

Theorem  3.  Ro  is  the  field  of  all  rational  numbers.  Every  number  field  contains 
Ro  as  a  subfield. 

The  proof  is  simple  and  is  left  to  the  reader. 

Definition  2.  Let  w\  be  a  positive  number  in  To  (generated  by  <21, 

•  •  • ,  an)  such  that  V w\  is  not  in  To.  Then  the  number  field  F\  generated  by 

ah  •  •  an ,  V7 w\  will  be  called  a  square  root  extension  of  field  To.  Let  To,  T 1, 
T2,  •  •  •,  Fm  be  a  sequence  of  number  fields  such  that  Tt  is  a  square  root 
extension  of  Tt_  1  (z  =  1,  •  •  •,  m).  Then  this  sequence  of  fields  will  be  called 
a  square  root  extension  chain  of  To,  and  the  fields  T 0,  Ti,  •  •  * ,  Fm  will  be  called 
the  links  of  the  chain.  We  shall  let  VWl  denote  the  radical  that  extends 
Tt_  1  to  Tt.  __ 

Theorem  4.  Every  number  of  Fi  (i  ^  1)  has  the  form  r  T  sVwi,  where 

r  and  s  are  in  F  i _ 1,  and ,  conversely ,  every  number  of  this  form  is  in  Fj. 

*For  a  history  of  these  problems  and  allied  topics,  see  H.  Eves,  Chap.  4. 
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The  generators  of  F,  are  clearly  of  the  required  form.  To  prove  the 
direct  part  of  the  theorem  it  then  suffices  to  show  that  if  A  and  B  are  of 
the  required  form,  then  so  are  A  +  B,  A  —  B,  AB ,  and  A/B  ( B  0)  of  the 
required  form.  To  this  end  set 

A  =  a  +  by/wi,  B  =  c  A~  d'fiVi, 

where  a,  b ,  c ,  d  are  in  /q_i.  Then 

A  +  B  =  {a  +  c)  +  (b  +  d)\/wi, 

A  —  B  =  (a  —  c)  A~  (b  —  d)\//Wi, 

AB  =  ( ac  +  bdwi )  +  {ad  +  bc]^wt, 

A/B  =  {a  -f-  bV'w^Kc  +  d^Wi) 

=  {a  +  b^/wi){c  —  d^wb)l{c  +  d^/wt){c  —  d^w,) 

=  [(tfc  —  bdwi)/{c 2  —  Ai)  ]  +  [(Ac  —  ad)/{c 2  —  c/Ae,)  (v7^-, 

and  we  see  that  T  -f-  B,  A  —  B,  AB ,  and  A/B  {B  ^  0)  are  of  the  required 
form. 

The  proof  to  the  converse  part  of  the  theorem  is  trivial. 

Theorem  5.  Any  number  in  Fm  is  a  root  of  a  polynomial  of  degree  2r  with 
coeff  dents  in  r  =  1,2,  •  •  •,  m. 

Let  v  be  a  number  in  Fm.  Then,  by  Theorem  4,  x  =  a  +  b  V7 wm ,  where 
a  and  b  are  in  Fm-\.  It  follows  that 

v  —  a  =  bV wm  or  v2  —  2  ax  +  {a2  —  b2wm)  =  0, 

and  v  is  a  root  of  a  quadratic  polynomial  with  coefficients  in  Fm- 1. 

We  now  have  v2  +  px  Ar  q  —  0,  where  p  and  q  are  numbers  in  Fm_\. 

By  Theorem  4,  p  =  c  +  (Iv  wm-\,  q  =  e  +  fy/wm-h  where  r,  d ,  e,  f  are  in 
Tw_2.  It  follows  that 

*2  +  N  +  +  (c  +  fy/ Wm- i)  =  0, 

or 

(v2  +  r*  +  e)2  —  {dx  +  f)2wm_  i  =  0, 

and  .v  is  a  root  of  a  quartic  polynomial  with  coefficients  in 

We  now  have  *4  +  rC  +  W2  +  /*  -j-  ?/  =  0,  where  r,  r,  /,  ?/  are  numbers 

in  By  Theorem  4,  r  =  g  +  hV Wm_2,  s  =  j  +  kV'Wm-2,  t  =  l  +  nV wm-2, 

u  —  p  +  (J ^ wm- 2,  where  g,  //,  ;,  A:,  /,  zz,  /?,  q  are  in  Fm_3.  Substituting  these 
values  for  r,  c,  /,  z/  in  the  quartic  equation,  transposing  the  terms  containing 

the  radical  y/  wm-2  to  one  side  of  the  equation,  and  then  squaring  both  sides, 
we  find  that  v  is  a  root  of  an  octic  polynomial  with  coefficients  in  Fm_ 3. 

A  step-by-step  continuation  of  the  above  process  establishes  the  theorem. 

Definition  3.  A  point  of  the  Cartesian  plane  is  said  to  belong  to  F,  if 

its  coordinates  are  in  F,-,  and  a  straight  line  or  a  circle  is  said  to  belong  to  Fx 
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if  the  coefficients  in  the  equation  of  the  straight  line  or  of  the  circle  are 
proportional  to  numbers  in  F;. 

Theorem  6.  (1)  If  points  P  and  Q  belong  to  F;,  then  the  line  PQ  belongs 
to  F;.  (2)  If  two  distinct  intersecting  lines  belong  to  Fi,  then  their  point  of  intersection 
belongs  to  Fi.  (3)  If  points  P  and  Q  belong  to  F i,  then  the  circle  P(Q)  belongs  to  F*. 
(4)  If  an  intersecting  line  and  circle  belong  to  Fi,  then  their  points  of  intersection 
belong  to  Fi,  or  to  some  square  root  extension  Fi+i  of  Fj.  (5)  If  two  distinct  inter¬ 
secting  circles  belong  to  F i,  then  their  points  of  intersection  belong  to  F \  or  to  some 
square  root  extension  Fi+i  of  Fi. 

1.  Let  P  =  ( a,b ),  Q  —  (c,d),  where  a ,  b,  c,  d  are  in  Ft.  Then  the  equation 
of  PQ  is 

(b  —  d)x  +  (c  —  a)y  +  {ad  —  be)  —  0, 

the  coefficients  of  which  are  in  F,. 

2.  Let  the  equations  of  the  lines  be 

ax  -p  by  -p  c  =  0  and  dx  +  ey  ~P  f  —  0, 

where  a,  b,  c,  d,  e,  f  are  in  F,  and,  to  guarantee  intersection,  ae  —  bd  ^  0. 
Then  the  coordinates  of  the  point  of  intersection  of  the  lines  are 

bf  —  ce  dc  —  af 

x  =  - . y  = - — , 

ae  —  bd  ae  —  bd 

which  are  in  F,. 

3.  Let  P  =  (af),  Q  =  ( c,d ),  where  a,  b,  c,  d  are  in  Ft.  Then  the  equation 
of  circle  P(Q)  is 

x2  -p  y2  —  2 ax  —  2 by  —  (c2  +  d2  —  2 ac  —  2 bd)  =  0, 

the  coefficients  of  which  are  in  Ft. 

4.  Let  the  equations  of  the  line  and  circle  be 

ax  -p  by  -p  c  =  0  and  x2  +  y2  +  gx  +  hy  +  k  =  0, 

where  a,  b,  c,  g,  h,  k  are  in  F,.  Then  the  coordinates  of  the  points  of 
intersection  of  the  line  and  circle  are  given  by 

-A  ±  bV~R  -B  ±W~R 

X  =  2 M  ’  V ~  2M 

where  M  =  a2  -p  b2  and 

A  =  2  ac  +  b2g  —  abh ,  B  =  2  be  +  a2h  —  bag , 

r  =  a2/p  q_  b2g2  -  4 a2k  -  4 b2k  -  4c2  +  4 aeg  T  4 bch  -  2 abgh. 

These  coordinates  are  in  Fi  (if  the  radical  belongs  to  F,)  or  in  some 
square  root  extension  /q+i  of  Fi  (if  the  radical  does  not  belong  to  F,j. 
Let  the  equations  of  the  circles  be 

x2  A~  y2  +  gx  +  hy  T  k  =  0  and  x2  -p  y2  +  px  +  qy  -p  r  =  0, 


5. 
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where  g,  h,  k,  p,  q ,  r  are  in  Ft*.  Then  the  points  of  intersection  of  the 
two  circles  are  the  points  of  intersection  of  either  circle  with  their 
radical  axis 

(g  ~  P)x  +  (h  -  q) y  +  (k  -  r)  =  0, 
whose  coefficients  are  in  F{.  Part  (5)  is  now  reduced  to  part  (4). 

Theorem  7.  Let  ai,  •  •  •,  an  be  the  distinct  nonzero  Cartesian  coordinates  of  a 
set  of  given  points.  Then  any  point  which  can  be  constructed  from  the  given  points  by 
straightedge  alone  belongs  to  the  field  F0  generated  by  a  i,  •  •  •,  an. 

This  is  a  consequence  of  Theorem  6. 

Note.  The  converse  of  this  theorem  is  not  true.  That  is,  there  are 
points  in  F0  which  cannot  be  constructed  from  the  given  points  with  straight¬ 
edge  alone. 

Theorem  8.  Let  ai,  •  •  •,  an  be  the  distinct  nonzero  Cartesian  coordinates  of  a 
set  of  given  points.  Then  any  point  which  can  be  constructed  from  the  given  points  by 
the  Euclidean  tools  belongs  to  some  square  root  extension  chain  of  the  field  F  o  generated 
by  3.1,  •  •  •,  an.  Conversely,  any  point  belonging  to  a  square  root  extension  chain  of  F0 
can  be  constructed  with  Euclidean  tools. 

The  direct  part  of  the  theorem  is  a  consequence  of  Theorem  6.  The 
converse  part  of  the  theorem  follows  from  Theorems  4  and  2. 

Definition  4.  The  set  F  of  all  numbers  belonging  to  square  root  exten¬ 
sion  chains  of  F0  is  known  as  the  constructible  set  generated  by  a \,  •  •  *,  an.  The 
constructible  set  generated  by  the  number  1  will  be  designated  by  R. 

Definition  5.  A  number  is  said  to  be  algebraic  if  it  is  a  root  of  a  poly¬ 
nomial  equation  with  coefficients  in  R0.  * 

Theorem  9.  All  numbers  in  R  are  algebraic. 

This  is  an  immediate  consequence  of  Theorem  5  and  Definition  5. 

Theorem  10.  If  a  cubic  polynomial  equation  with  rational  coefficients  has  no 
rational  root,  then  none  of  its  roots  belongs  to  R. 

Let  the  cubic  equation  be 

v3  -f  ax 2  -p  bx  +  c  =  0, 

where  a,  b,  c  are  in  R()  but  x  is  not.  Suppose  x  is  in  R.  Then  x  lies  in  the 
last  link  Rk  of  some  square  root  extension  chain  R0,  Rh  .  .  .,  Rkof  the  rational 
field  Rf).  Let  k  be  the  smallest  integer  such  that  a  root  of  the  cubic  lies  in  a 
square  root  extension  link,  like  Rk,  of  Ro.  Since  x  is  not  in  Rq,  k  >  0.  We  then 

have  (Theorem  4)  x  =  p  -p  q^Wk,  where  p,  q,  wk  are  in  Rk~ i,  but  is 
not  in  Rk—i.  Substituting  x  =  p  +  q'd'wk  in  the  cubic  equation,  we  find 
P  -p  Qfi/wk  =  0,  where 

*The  equivalence  of  this  definition  with  that  given  in  Section  8.4  is  the  subject 
matter  of  Problem  8.4.4  (g). 
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P  =  p3  +  3  pq^Wk  +  ap 2  -f-  +  bp  +  c, 

Q  =  3p2q  +  q*wk  +  2  apq  +  6?. 

Since  P  and  Q,  are  in  P*-i ,  whereas  is  not  in  P/t-i,  it  follows  that  we 

must  have  P  =  Q  =  0,.and  hence  P  —  QV wk  =  0.  But,  if  we  replace  x  by 
p  —  qVwk  in  the  left  member  of  the  cubic  equation,  we  obtain  P  —  QVwk. 

Therefore  jy  =  p  —  q^/wk  is  also  a  root  of  the  cubic  equation.  Since  q  ^  0 
(inasmuch  as  otherwise  *  would  be  in  Rk~i),  we  see  that  x  9^  y.  Let  z  be 
the  third  root  of  the  cubic  equation.  Then,  from  the  elementary  theory  of 
equations,  *  +  y  +  z  —  —a,  or 

Z  —  —a  —  x  —  y  —  —a  —  2  p, 

a  number  in  Rk-i.  But  this  is  a  contradiction  of  the  assumption  that  k  is  the 
smallest  integer  such  that  a  root  of  the  cubic  equation  lies  in  a  square  root 
extension  link,  like  Rk,  of  Rq.  The  theorem  now  follows  by  reductio  ad  absurdum. 

Theorem  11.  Given  a  line  segment  AB,  it  is  impossible  to  construct  with 
Euclidean  tools  a  line  segment  CD  such  that  (CD)3  =  2(AB)3. 


O' 

-• — — 

B 


-o - — ► 

E 


FIG.  49 


Place  a  Cartesian  frame  of  reference  on  the  plane  with  origin  at  A  and 
with  positive  x-axis  along  AB.  Choose  the  scale  so  that  AB  is  one  unit.  We 
are  then  given  the  points  (0,0)  and  (1,0).  Now  assume  the  construction  is 
possible.  Then  we  can  construct  with  Euclidean  tools  the  point  E  on  the 

positive  x-axis  such  that  AE  =  CD  —  -^/2{AB)  —  ^/2.  That  is,  we  can 
construct  the  point  (^/2,0),  whence  -\J~2  is  in  the  constructible  set  R  generated 
by  1.  But  ^/2  is  a  root  of  the  cubic  equation  x,f  —  2  =  0.  Since  this  cubic 

has  rational  coefficients  and  no  rational  root,*  Theorem  10  says  that  -\J~2  is 
not  in  R.  The  contradiction  proves  the  theorem. 

Note.  Theorem  11  shows  that  it  is  impossible  to  construct  with 
Euclidean  tools  the  edge  of  a  cube  having  exactly  twice  the  volume  of  a 
given  cube,  and  thus  disposes  of  the  ancient  problem  of  duplicating  the  cube. 

*If  a  polynomial  equation 

aoxn  +  aixn~* l  T  •••+«„  =  0 

with  integral  coefficients  ao,  a\,  ■  ■  an  has  a  reduced  rational  root  ajb,  then  a  is  a 
factor  of  an  and  b  is  a  factor  of  ao.  Thus  any  rational  roots  of  x3  2  =0  are  among 

1  ?  —1,2,  —2.  Since,  by  direct  testing,  none  of  these  numbers  satisfies  the  equation, 

the  equation  has  no  rational  roots. 
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Theorem  12.  It  is  impossible  to  construct  with  Euclidean  tools  the  trisectors 
of  a  60°  angle. 


FIG.  50 

Starting  with  the  two  points  0  —  (0,0)  and  A  =  (1,0),  we  may  construct 
an  angle  AOB  =  60°  by  drawing  circles  0(A)  and  A(O)  to  intersect  in  B. 
Now  assume  that  it  is  possible  with  Euclidean  tools  to  draw  the  trisectors 
of  angle  AOB.  Let  the  trisector  adjacent  to  OA  cut  circle  0(A)  in  C.  Find, 
with  Euclidean  tools,  the  foot  D  of  the  perpendicular  from  C  upon  OA,  and 
denote  the  coordinates  of  D  by  (*,0).  Then  x,  which  is  equal  to  cos  20°,  is 
in  the  constructible  set  R  generated  by  1. 

Now,  from  the  trigonometric  identity 

,  3  0  e 

cos  6  =  4  cos'5 - 3  cos 

3  3 

we  obtain,  by  taking  6  =  60°, 

8* 3  —  6x  —  1  =0. 

It  is  easily  verified  that  this  cubic  equation  with  rational  coefficients  has  no 
rational  root.  Hence,  by  Theorem  10,  at  cannot  lie  in  R,  and  the  theorem 
follows  by  reductio  ad  absurdum. 

Note.  Since  it  is  impossible  with  Euclidean  tools  to  trisect  an  angle  of 
60°,  it  follows  that  there  is  no  general  procedure,  using  Euclidean  tools,  for 
trisecting  an  arbitrary  given  angle.  Theorem  12  disposes  of  the  ancient 
problem  of  trisecting  an  angle.  Note  that  we  have  not  proved  that  no  angle 
can  be  trisected  with  Euclidean  tools,  but  only  that  not  all  angles  can  be  so 
trisected.  As  a  matter  of  fact,  an  angle  of  90°,  and  an  infinite  number  of 
other  angles,  can  be  trisected  with  Euclidean  tools. 
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Theorem  13.  It  is  impossible  to  construct  with  Euclidean  tools  a  regular 
polygon  of  nine  sides. 

If  such  a  polygon  could  be  constructed,  then  the  central  angle  of  the 
polygon  could  also  be  constructed.  But  the  central  angle  of  a  regular  polygon 
of  nine  sides  is  40°  =  (2/3)60°,  and  a  60°  angle  could  then  be  trisected. 

Theorem  14.  It  is  impossible  to  construct  with  Euclidean  tools  an  angle  of  1°. 

If  an  angle  of  1°  could  be  constructed,  then  an  angle  of  20°  could  be 
constructed.  But  we  have  seen  that  this  is  impossible. 

Theorem  15.  It  is  impossible  to  construct  with  Euclidean  tools  a  square  whose 
area  is  equal  to  that  of  a  given  circle. 
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Take  the  radius  of  the  circle  as  one  unit  and  place  the  center  of  the 
circle  at  the  point  (0,1).  Let  *  represent  half  the  side  of  the  sought  square. 

Then  4x2  =  i r,  and  *  =  V tt/2.  Suppose  the  construction  is  possible.  Then, 
starting  with  the  two  points  (0,1)  and  (0,0),  we  can  construct  the  point 

(V tt/2,0),  and  it  follows  that  v/7r/2,  and  hence  7 r,  is  in  R.  This  contradicts 
Theorem  9,  which  says  that  all  numbers  in  R  are  algebraic,  for  it  is  known 
that  7r  is  nonalgebraic.  The  contradiction  proves  the  theorem. 

Note.  Theorem  15  disposes  of  the  ancient  problem  of  squaring  a  circle. 

A. 4  A  Constructive  Proof 
of  the  Existence 
of  Transcendental  Numbers 

Cantor’s  proof  of  the  existence  of  transcendental  numbers,  as  given  in 
Theorem  4  of  Section  8.4,  can  scarcely  be  called  a  constructive  proof;  it 
does  not  lead  to  a  number  whose  decimal  expression,  for  example,  can 
actually  be  written  down.  Many  mathematicians,  among  whom  are  the 
members  of  the  intuitionist  school  of  philosophy  of  mathematics,  do  not 
accept  nonconstructive  existence  proofs  but  demand  that  an  existence  proof 
should  produce  an  actual  example  of  one  of  the  entities  whose  existence  is 
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being  demonstrated.  Such  a  proof  of  the  existence  of  transcendental  numbers 
was  given  by  Joseph  Liouville  (1809-1882)  in  1851,  twenty-some  years  before 
Cantor  published  his  nonconstructive  demonstration.  We  give  here  a  modern 
and  simplified  version  of  Liouville’s  approach,  which,  though  it  employs  only 
high  school  mathematics,  will  require  careful  reading  and  close  attention. 
We  begin  by  recalling  once  more  the  definition  of  an  algebraic  number  and 
of  a  transcendental  number. 

Definition  1.  A  number  a  of  the  complex  number  system  is  said  to  be 
an  algebraic  number  if  there  exists  a  positive  integer  n  and  a  set  of  integers 
<2o,  a i,  •  •  •,  an  (tfo  7*  0)  such  that 

a^an  +  aiotn~l  +  •  •  •  -f  an-ict  +  an  =  0. 

If  n  is  the  smallest  such  positive  integer,  then  a  is  said  to  be  of  degree  n. 

Definition  2.  A  number  of  the  complex  number  system  which  is  not 
algebraic  is  said  to  be  a  transcendental  number. 

Theorem  1  (Liouville’s).  If  a  is  an  algebraic  number  of  degree  n  >  1,  there 
exists  a  number  c,  0  <  c  <  1,  such  that  for  all  integers  p  and  q,  where  q  >  0, 
we  have 

\<*  ~  p/q|  >  c/q\  (1) 

We  are  given  that 

aQan  -f-  aian~l  +•••-{-  an_xa  +  an  =  0, 
where  n  >  1,  the  at? s  are  integers,  and  ao  ^  0.  Let  us  set 

f(x)  =  aoxn  +  a \xn~ 1  -f-  •  •  •  +  <2n- \X  +  an. 

Then  f(a)  =  0,  and  we  have 

-f(plq)  =  fW  -f(P/q) 

=  «o[«n  -  (p/q)n]  +  -  {p/q)n~l] 

+  •  •  •  +  an_2[od  —  ( plq )2]  +  an- i(a  -  p/q) 

=  (a  -  p/q){a0[an~l  +  ( p/q)an~ 2  +  •••  +  (plq)n~x  1 

+  '  '  *  +  an-2(a  -f  p/q )  +  J.  (2) 

Since  n  >  \,  a  must  be  irrational,  for  a  rational  number  is  algebraic  of 
degree  1.  Therefore  a  p j q  ^  0,  and  we  must  have  f{p/q)  9^  0,  inasmuch 
as  otherwise  the  second  factor  in  the  right  member  of  (2)  would  have  to 
vanish,  making  a  of  degree  less  than  n,  which  contradicts  the  hypothesis  of 
the  theorem.  Consequently,  we  have 

l/(/?/<7)|  =  |flo pn  +  a\pn~xq  +  •••  +  anqn\lqn  ^  1  /qn.  (3) 

But,  from  equation  (2) 

\f(p!q)\  ^  \<X  -  p/q\  { ko|  (|a|n~;  +  \p/q\  |«|"-2  +  •••  +  \p/q\u~y) 

+  +  kn-2|(|a|  +  \p!q\)  T  kn-i|).  (4) 
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a 


p/q\  =  1,  we  may  assume  a  —  pit  I  <  i- 

in  the  second 


a 


Since  (1)  is  obviously  true  if 

Then  \p/q\  <  \a\  +  1.  Therefore,  replacing  both  \p/q\  and 
factor  of  the  right  member  of  (4)  by  \a\  +  1,  we  have 

\f(plq)\  <  |a  -  pM{n\aQ\(\a\  +  l)n-1  +  *  *  * 

+  2|tfn_2|(|o!  +  1)  +  |tfn— 1||  = 


a 


pit  \k. 


say,  where  K  is  clearly  a  number  greater  than  1 .  Combining  this  result  with 
(3),  we  have 

a  -  plq\  >  \f(p/q)\/K  ^  r/?n, 
where  c  =  1  /K,  and  the  theorem  is  proved. 

Theorem  2.  The  real  number 

oo 

t  =  E  ak/rk!, 

k  =  1 


where  r  is  an  integer  greater  than  1  and  the  &kS  are  integers  such  that  0  <  ak  <  r, 
is  a  transcendental  number. 

Suppose  t  is  algebraic.  Then  t  is  irrational  (since  its  radix  fraction  for 
base  r  is  nonperiodic)  and  hence  of  degree  n  >  1.  By  Theorem  1,  then, 
there  exists  a  real  number  c,  0  <  c  <  1,  such  that 


t  -  p/q\  >  cjqn 


for  every  rational  number  p/q  with  q  > 
number 

m 


2  ak/rk[  = 
fc=i 


0.  Let  us  choose  for  p/q  the  rational 
pjrm[. 


Then  we  have 


oo 


c/rm]n  <  \t  —  plq\  =  E  ak/r 

*=  i 


k\ 


m 


E  ak/rk] 
jfc= i 


=  L  ak/rk]  <  (1  +  1/r  +  1/r2  +  *  •  •  )/r^+"'-1 

k=m+ 1 

=  [r/(r  -  \)]lr^+»'-'  S  rjr^+n'-\ 
or 

q  ^  yy-m ! n  Jy{ra-\- 1) ! — 1  =  1  jym\(.Tn-\-l  n)  2^ 

which,  since  c  >  0,  is  impossible  for  large  enough  m.  Hence  the  theorem 
by  reductio  ad  absurdum. 

Definition  3.  Numbers  like  t  of  Theorem  2  are  called  Liouville  numbers 
to  the  base  r.  If  we  take  r  =  10,  we  obtain  the  Liouville  numbers  to  base  10, 

oo 

t  =]T  *,/10*!  =  0.«i.2000a300000000000000000.4000 

k=  1 

where  the  ak  s  are  arbitrary  digits  from  1  to  9. 
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A. 5  The  Axiom  of  Choice 

It  will  be  recalled  (see  Problem  5.5.11  or  Section  6.2)  that  a  set  S  of 
elements  a,  b,  c,  •  •  •  is  said  to  be  simply  ordered  with  respect  to  a  dyadic 

reation  “<”  (which  we  may  read  “precedes”)  if  the  following  three  postu¬ 

lates  are  satisfied: 

Ol:  If  a  *  b,  then  either  a  <  b  or  b  <  a. 

02:  If  a  <  b,  then  a  ^  b. 

03:  7/ a  <  b  and  b  <  c,  then  a  <  c. 

The  set  S  is  said  to  be  well-ordered  with  respect  to  the  relation  “<”  if  the 
following  fourth  postulate  also  holds: 

04:  If  T  is  any  nonempty  subset  of  S,  then  there  exists  an  element  a  of  T  such 
that  if  b  is  any  other  element  of  T,  then  a  <  b. 

In  short,  a  set  S  is  well-ordered  with  respect  to  a  dyadic  relation  “  <” 
if  it  is  simply  ordered  with  respect  to  “<”  and  if  every  nonempty  subset 
of  S  has  a  first  element. 

An  obvious  example  of  a  well-ordered  set  is  the  set  of  all  natural 
numbers,  1,  2,  3,  •  •  •,  where  “  <”  means  “is  less  than”  in  the  ordinary  sense. 
For  a  more  recondite  example,  consider  the  natural  numbers,  1,  2,  3,  •  •  •, 
where  “<”  is  defined  as  follows:  (1)  if  a  is  odd  and  b  is  even,  then  a  <  b; 
(2)  if  a  and  b  are  both  odd  or  both  even,  then  a  <  b  denotes  the  natural 
order.  Schematically,  the  order  of  the  natural  numbers  in  this  second 
example  is 

1  357  •••*2468  ••• 

and  it  is  an  easy  matter  to  show  that  all  four  order  postulates  are  satisfied. 

Now  there  is  a  surprising  theorem  concerning  well-ordering  which  was 
proved  by  E.  Zermelo  in  1904  and  which  may  be  stated  as  follows: 

Zermelo’s  Well-ordering  Theorem.  If  S  is  any  set  whatever ,  there  exists 
a  dyadic  relation  “<”  for  the  set  S  with  respect  to  which  S  is  well-ordered.* 

The  reaction  of  many  mathematicians  to  Zermelo’s  theorem  was  that 
there  must  be  something  wrong  with  the  proof;  the  theorem  seemed  too 
incredible  to  be  accepted.  Consider,  for  example,  the  set  S  of  all  real  numbers 
a:  such  that  0  ^  x  ^  1 .  If  we  take  “  <  ”  to  mean  “is  less  than”  in  the  ordinary 
sense,  then  this  set  of  real  numbers  is  not  well-ordered.  To  see  this,  we  have 
only  to  let  T  be  the  subset  obtained  from  S  by  deleting  the  single  number  0. 
Since  there  is  no  first  real  number  greater  than  0,  this  subset  T  has  no  first 
element,  and  Postulate  04  fails  to  hold.  Now  S  is  a  nondenumerable  set  of 
cardinal  number  c.  No  one  has  ever  been  able  to  effect  a  well-ordering  of 
such  a  set,  much  less  a  well-ordering  of  a  set  of  higher  cardinal  number. 
Consequently,  many  mathematicians  firmly  felt  that  no  definition  of  “<” 

*This  theorem  had  been  conjectured  by  Cantor  as  early  as  1883. 
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can  exist  with  respect  to  which  such  a  set  would  be  well-ordered.  Not  so, 
however,  according  to  Zermelo’s  theorem.  Even  if  we  have  a  set  S  of 
cardinal  number  higher  than  c,  say  the  set  of  all  single-valued  functions /(v) 
defined  on  the  interval  0  ^  x  1,  Zermelo’s  theorem  applies — there  exists 
a  definition  of  “<”  with  respect  to  which  the  set  is  well-ordered!  This,  if 
true,  is  very  remarkable. 

In  the  search  for  a  flaw  in  Zermelo’s  proof,  E.  Borel  discovered  that 
Zermelo  had  based  his  demonstration  on  a  seemingly  obvious  principle  which 
mathematicians  had  been  using  for  a  long  time,  but  which  no  one  had  ever 
considered  listing  explicitly  as  an  assumption.  This  principle  may  be  stated 
as  follows: 

Zermelo’s  Postulate  or  The  Axiom  of  Choice.  If  a  set  S  is  divided 
into  a  collection  of  mutually  disjoint  nonempty  subsets  A,  B,  C,  ••• ,  there  exists  at 
least  one  set  R  which  has  as  its  elements  exactly  one  element  from  each  of  the  subsets 

A,  B,  C,  •  •  • . 

Since  the  set  R  can  evidently  be  formed  if  one  may  choose  an  element 
from  each  of  the  subsets  A,  B,  C,  •  •  •  of  S,  the  principle  asserts  that  such  a 
choice  may  be  made;  this  accounts  for  the  name,  “axiom  of  choice.” 

To  illustrate  the  meaning  of  the  principle,  let  S  be  the  set  of  all  people 
residing  in  the  United  States,  and  let  A,  B,  C,  •  •  •  be  the  subsets  of  these 
people  residing  in  the  various  fifty  states,  where  we  assume  that  each  resident 
of  the  United  States  lives  in  one  and  only  one  state.  Then  the  axiom  of 
choice  claims  that  there  exists  at  least  one  set  R  consisting  of  one  and  only 
one  resident  of  each  state. 

Borel  pointed  out  that  not  only  is  Zermelo’s  theorem  based  on  the 
axiom  of  choice  but  that  it  is  actually  equivalent  to  the  axiom  of  choice. 
For  we  can  easily  show  that  if  we  admit  that  every  set  possesses  a  well¬ 
ordering,  then  the  axiom  of  choice  must  follow.  Thus,  let  S  be  any  set 
which  is  divided  into  a  collection  of  mutually  disjoint  subsets  A,  B,  C, 
Granting  Zermelo’s  theorem,  there  exists  a  definition  of  “<”  with  respect 
to  which  S  is  well-ordered.  By  Postulate  04,  each  of  the  subsets  A,  B,  C, 
of  S  possesses  a  first  element.  We  now  establish  the  axiom  of  choice  by 
choosing  for  R  the  set  of  all  these  first  elements.  Accepting  Zermelo’s  proof 
of  his  theorem  from  the  axiom  of  choice,  it  follows  that  the  theorem  and 
the  axiom  are  equivalent. 

The  publication  of  the  papers  by  Zermelo  and  Borel  calling  attention 
to  the  above  facts  touched  off  a  spirited  controversy  concerning  the  hitherto 
harmless  axiom  of  choice.  Sharp  differences  of  opinion  on  the  matter  were 
expressed  by  many  eminent  mathematicians.  Subsequent  discovery  of  further 
consequences  of  the  axiom  of  choice  has  not  altered  the  situation,  and  today 
one  finds  attitudes  toward  the  axiom  of  choice  ranging  from  complete 
acceptance,  through  various  degrees  of  skepticism,  to  total  rejection.  Modern 
researchers  in  topology  apparently  assume  the  axiom  with  no  hesitation, 
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for  there  seems  little  evidence  that  any  significant  portion  of  topology  can 
be  derived  without  its  use.  In  algebra,  the  matter  is  quite  different.  Though 
there  are  some  operations  in  algebra  which  are  quite  awkward  without  the 
axiom  of  choice,  so  much  can  be  accomplished  without  it  that  algebraists 
are  inclined  to  proceed  as  far  as  possible  without  its  use.  A  great  deal  of 
analysis  can  be  established  without  the  use  of  the  axiom  of  choice,  but 
when  one  reaches  measure  theory  and  those  portions  of  modern  analysis 
which  rely  extensively  on  topological  ideas,  its  evasion  becomes  difficult  if 
not  impossible.  Today,  when  anyone  publishes  a  proof  in  which  the  axiom 
of  choice  is  used,  it  is  customary  to  call  attention  to  the  fact.  There  are  a 
number  of  famous  theorems  of  mathematics,  particularly  in  analysis  and  in 
set  theory,  for  which  proofs  avoiding  the  use  of  the  axiom  of  choice  are 
zealously  sought. 

Much  of  the  objection  to  the  axiom  of  choice  lies  in  one’s  conception 
of  mathematical  existence.  The  axiom  asserts  the  existence  of  a  certain  set  R 
but  says  nothing  about  ways  of  finding  R,  or  even  that  it  is  possible  to  find  it; 
the  assertion  is  merely  that  set  R  exists.  Now  some  mathematicians,  among 
whom  are  the  members  of  the  intuitionist  school  of  philosophy  of  mathe¬ 
matics,  deny  the  mathematical  existence  of  a  set  if  there  is  no  known  way 
of  ascertaining  the  members  of  the  set.  From  this  point  of  view,  a  set  is  a 
fit  subject  for  mathematical  investigation  only  if  a  rule  of  construction  is 
given  which  yields  the  members  of  the  set,  and  any  statement  dealing  with 
a  set  whose  claim  to  existence  is  not  validated  by  this  uncompromising 
criterion  of  construction  is  not  acceptable  as  a  mathematical  statement. 

It  should  now  be  clear  why  it  is  desirable  to  label  those  theorems  whose 
proofs  employ  the  axiom  of  choice.  For  some  mathematicians  such  theorems 
have  not  been  established  at  all,  while  even  for  those  who  accept  the  proofs, 
the  question  remains  as  to  whether  the  theorems  can  perhaps  be  proved 
without  the  use  of  the  axiom  of  choice  and  thus  be  made  acceptable  to  their 
more  particular  colleagues  of  the  other  school. 

In  order  further  to  clarify  some  of  the  above  remarks,  consider  the 
following  often  quoted  example,  the  original  version  of  which  was  given  by 
Bertrand  Russell.  In  an  infinite  collection  of  pairs  of  shoes,  no  axiom  of 
choice  is  needed  to  establish  the  existence  of  a  set  of  shoes  containing  exactly 
one  shoe  from  each  pair;  such  a  set  may  be  effectively  constructed  by  the 
simple  rule  of  selecting  all  right  shoes  from  the  collection.  But  suppose  we 
have  an  infinite  collection  of  pairs  of  socks,  all  socks  alike  as  to  size,  color, 
etc.,  and  thus  completely  indistinguishable  from  one  another.  Does  there 
exist  a  set  containing  exactly  one  sock  from  each  pair?  If  we  are  to  accept 
such  a  set,  we  must  do  so  by  the  axiom  of  choice,  for  now  there  apparently 
is  no  possible  rule  by  which  the  desired  set  may  be  constructed. 

As  a  more  mathematical  example,  consider  the  following.  Let  S  be  the 
set  of  all  real  numbers  *  such  that  0  ^  x  ^  1.  Decompose  S  into  subsets  by 
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putting  into  the  same  subset  all  members  of  S  which  differ  from  one  another 
by  a  rational  number.  We  then  have  a  collection  of  subsets  of  S  which  are 
mutually  disjoint  and  nonempty.  Without  the  aid  of  the  axiom  of  choice 
there  is  no  way  of  demonstrating  the  existence  of  a  set  R  containing  exactly 
one  number  from  each  of  these  subsets.  Such  a  set  R  exists  only  by  special 
sanction. 

We  thus  see  that  if  we  can  constructively  define  the  elements  of  set  R 
in  a  perfectly  definite  way,  we  can  avoid  the  axiom  of  choice;  if  we  cannot 
so  define  the  elements  of  R ,  we  must  have  recourse  to  the  axiom  of  choice. 

Even  one  whose  conception  of  mathematical  existence  does  not  seem 
to  clash  with  the  axiom  of  choice  can  easily  develop  considerable  skepticism 
with  regard  to  the  axiom,  for  the  axiom  is  remarkably  far-reaching,  and 
with  its  aid  some  very  startling  theorems  can  be  deduced.  As  an  example,  let 
us  cite  an  astounding  result  obtained  in  1924  by  S.  Banach  and  A.  Tarski. 

The  Banach-Tarski  Paradox.  In  any  Euclidean  space  of  dimension  n  >  2, 
any  two  arbitrary  bounded  sets  containing  interior  points  are  equivalent  by  finite 
decomposition. 

Without  attempting  to  explain  the  meanings  of  all  the  technical  terms 
in  this  statement,  we  shall  merely  illustrate  by  an  example.  Consider  two 
solid  spheres,  P  and  S ,  where  P  is  the  size  of  a  pea  and  S  is  the  size  of  the 
sun.  Then,  according  to  the  statement,  it  is  possible  to  decompose  the  set  of 
points  making  up  P  into  a  finite  number  of  mutually  disjoint  subsets  such 
that,  by  ordinary  rigid  motions  (translations  and  rotations),  these  sets  may 
be  made  to  fill  out  the  entire  sphere  S.  In  the  notation  of  set  theory  this 
means  that  we  have  subsets  Pi,  •  •  • ,  Pn  of  P  and  subsets  ^i,  •  •  • ,  Sn  of  S 
such  that 

P  =  Pi  W  P2SJ  •  •  •  SJ  Pn,  Pi  n  Pj  =  0  for  i  ^  j , 

S  =  Si  KJ  S2  Sd  •  •  •  yj  Sn,  Si  r\  Sj  =  0  for  i  j, 

and  Pi  is  congruent  to  Si  for  each  i.  Surely  such  a  state  of  affairs  must  appear 
astonishing  even  to  the  most  sophisticated  mathematician. 

For  many  years  a  great  open  problem  of  set  theory  has  concerned  the 
possibility  of  deriving  the  axiom  of  choice  from  the  postulates  of  certain 
existing  postulate  sets  of  set  theory.  Though  the  derivation  of  the  axiom 
is  easily  accomplished  for  the  case  where  the  set  S  (in  our  statement  of 
the  axiom  of  choice)  is  divided  into  a  finite  collection  of  mutually  dis¬ 
joint  nonempty  subsets,  no  one  as  yet  has  been  able  to  derive  the  axiom 
for  the  case  where  S  is  divided  into  an  infinite  collection  of  such  sub¬ 
sets,  and  the  feeling  is  that  this  latter  task  is  impossible.  Kurt  Godel 
proved,  however,  about  1940,  that  if  the  concerned  postulate  sets  for  set 
theory  are  consistent,  then  the  addition  of  the  axiom  of  choice  cannot 
lead  to  any  contradictions. 
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A. 6  The  Eudoxian  Resolution  of  the 
First  Crisis  in  the  Foundations 
of  Mathematics* 

In  Book  V  of  Euclid’s  Elements  appears  a  masterly  exposition  of  Eudoxus’ 
theory  of  proportion.  It  was  this  theory,  applicable  to  incommensurable  as 
well  as  to  commensurable  magnitudes,  that  resolved  the  “logical  scandal” 
created  by  the  Pythagorean  discovery  of  irrational  numbers.  The  remarkable 
Eudoxian  definition  of  proportion,  or  equality  of  two  ratios,  is  cited  at  the 
start  of  Book  V,  and  runs  as  follows:  Magnitudes  are  said  to  be  in  the  same  ratio , 
the  first  to  the  second  and  the  third  to  the  fourth ,  when ,  if  any  equimultiples  whatever 
be  taken  of  the  first  and  third ,  and  any  equimultiples  whatever  of  the  second  and  fourth , 
the  former  equimultiples  alike  exceed ,  are  alike  equal  to,  or  are  alike  less  than  the  latter 
equimultiples  taken  in  corresponding  order.  In  other  words,  if  A,  B,  C,  D  are  any 
four  unsigned  magnitudes,  A  and  B  being  of  the  same  kind  (both  line  seg¬ 
ments,  or  angles,  or  areas,  or  volumes)  and  C  and  D  being  of  the  same  kind, 
then  the  ratio  of  A  to  B  is  equal  to  that  of  C  to  D  when,  for  arbitrary  positive 

integers  m  and  n,  mC  =  nD  according  as  mA  =  nB. 

Let  us  indicate  the  differences  between  the  Pythagorean,  the  Eudoxian, 
and  the  modern  textbook  proofs  of  a  simple  proposition  involving  propor¬ 
tions.  We  select  Proposition  VI  1 :  The  areas  of  triangles  having  the  same  altitude 
are  to  one  another  as  their  bases.  We  shall  permit  ourselves  to  use  Proposition  I 
38,  which  says  that  triangles  having  equal  bases  and  equal  altitudes  have  equal  areas, 
and  a  consequence  of  I  38  to  the  effect  that  of  any  two  triangles  having  the 
same  altitude,  that  one  has  the  greater  area  which  has  the  greater  base. 


A 


FIG.  52 


Let  the  triangles  be  ABC  and  ADE,  the  bases  BC  and  DE  lying  on 
the  same  straight  line  MN,  as  in  Figure  52.  Now  the  Pythagoreans,  before 
the  discovery  of  irrational  numbers,  tacitly  assumed  that  any  two  line  seg¬ 
ments  are  commensurable.  Thus  BC  and  DE  were  assumed  to  have  some 

*  Adapted  from  Howard  Eves,  An  Introduction  to  the  History  of  Mathematics,  revised 
edition,  Holt,  Rinehart  and  Winston,  1964. 
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common  unit  of  measure,  going,  say,  p  times  into  BC  and  q  times  into  DE. 
Mark  off  these  points  of  division  on  BC  and  DE  and  connect  them  with 
vertex  A.  Then  triangles  ABC  and  ADE  are  divided,  respectively,  into  p  and  q 
smaller  triangles,  all  having,  by  I  38,  the  same  area.  It  follows  that  A  ABC  : 
A  ADE  =  p  :  q  =  BC  :  DE ,  and  the  proposition  is  established.  With  the  later 
discovery  that  two  line  segments  need  not  be  commensurable,  this  proof, 
along  with  many  others,  became  inadequate,  and  the  very  disturbing 
“logical  scandal”  came  into  existence. 

Eudoxus’  theory  of  proportion  cleverly  resolved  the  “scandal,”  as  we 
shall  now  illustrate  by  reproving  VI  1  in  the  manner  found  in  Book  VI  of 
the  Elements.  On  CB  produced  mark  off,  successively  from  B,  m  —  1  segments 
equal  to  CB ,  and  connect  the  points  of  division,  B2,  B 3,  •  •  *,  Bm ,  with  vertex 
A,  as  shown  in  Figure  53.  Similarly,  on  DE  produced  mark  off,  successively 


A 


FIG.  53 

from  E,  n  —  1  segments  equal  to  DE ,  and  connect  the  points  of  division, 
E2,  Es,  •  •  •,  En ,  with  vertex  A.  Then  BmC  =  m(BC ),  A ABmC  =  m(AABC), 
DEn  =  n(DE ),  A ADEn  —  n(AADE ).  Also,  by  I  38  and  its  corollary,  AABmC 

=  A ADEn  according  as  BmC  =  DEn.  That  is,  m{AABC)  =  n{AADE)  accord¬ 
ing  as  m(BC )  =  n(DE ),  whence,  by  the  Eudoxian  definition  of  proportion, 

AABC  :  A  ADE  =  BC  :  DE,  and  the  proposition  is  established.  No  mention 
was  made  of  commensurable  and  incommensurable  quantities  since  the 
Eudoxian  definition  applies  equally  to  both  situations. 

Many  present-day  high  school  textbooks  advocate  a  proof  of  this 
theorem  involving  two  cases,  according  as  BC  and  DE  are  or  are  not  com¬ 
mensurable.  The  commensurable  case  is  handled  as  in  the  Pythagorean 
solution  above,  and  simple  limit  notions  are  used  to  deal  with  the  incom¬ 
mensurable  case.  Thus,  suppose  BC  and  DE  are  incommensurable.  Divide 
BC  into  n  equal  parts,  BR  being  one  of  the  parts  (see  Figure  54).  On  DE 
mark  off  a  succession  of  segments  equal  to  BR,  finally  arriving  at  a  point  F 
on  DE  such  that  FE  <  BR.  By  the  commensurable  case,  already  established, 
A  ABC  :  AADF  =  BC  :  DF.  Now  let  n  ->  co .  Then  DF  ->  DE  and  A  ADF  — ► 
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FIG.  54 

AADE.  Hence,  in  the  limit,  A  ABC  :  A  ADE  =  BC  :  DE.  This  approach  uses 
the  fact  that  any  irrational  number  may  be  regarded  as  the  limit  of  a  sequence 
of  rational  numbers,  an  approach  that  was  rigorously  developed  in  modern 
times  by  Georg  Cantor  (1845-1918). 

A, 7  A  Note  on  GodeUs 

Incompleteness  Theorem 

In  1931  there  appeared,  in  the  journal  Monatshefte  fur  Mathematik  und 

•  • 

Physik ,  a  paper  entitled,  “Uber  formal  unentscheibare  Satze  der  Principia 
Mathematica  und  verwandter  Systeme”  (“On  formally  undecidable  propo¬ 
sitions  of  Principia  Mathematica  and  related  systems”).  The  author  of  the 
paper  was  a  25-year-old  Austrian  mathematician  and  logician  named  Kurt 
Godel,  who  was,  at  the  time,  at  the  University  of  Vienna.  When  the  paper 
appeared,  it  received  only  scattered  and  scant  attention,  for  it  concerned 
itself  with  a  highly  specialized  area  of  study  that  had  not  yet  attracted  many 
researchers,  and  it  used  a  type  of  proof  that  was  so  technically  novel  as  to 
be  incomprehensible  to  most  readers.  Within  a  very  few  years,  however,  the 
paper  became  widely  recognized  professionally  as  one  of  the  truly  epoch- 
making  contributions  to  the  foundations  of  mathematics  and  logic.  It  led, 
in  1938,  to  the  appointment  of  the  author  as  a  permanent  member  of  the 
Institute  for  Advanced  Study  at  Princeton  and,  in  1952,  to  the  rather  rare 
event  of  a  mathematician  receiving  an  honorary  degree  from  a  top-flight 
American  university— Harvard  University. 

Godel’s  paper  revealed  certain  unforeseen  limitations  in  axiomatic  pro¬ 
cedure.  In  particular,  it  accomplished  the  following:  (1)  It  upset  a  strong 
belief  that  all  the  important  areas  of  mathematics  can  be  completely  axioma- 
tized.  (2)  It  annihilated  all  hopes  of  establishing  the  inner  consistency  of 
mathematics  along  the  lines  that  had  been  envisioned  by  David  Hilbert. 
(3)  It  led  to  a  reappraisal,  not  yet  completed,  of  certain  widely-held  philos¬ 
ophies  of  mathematics.  (4)  It  introduced  into  foundational  studies  a  new, 
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powerful,  and  fertile  technique  of  analysis  which  has  suggested  and  initiated 
many  new  avenues  of  investigation. 

It  is  the  purpose  of  this  note  to  comment  on  the  above  four  points. 
Because  of  the  advanced  nature  of  the  material  and  methods  involved,  the 
discussion  will  be  brief  and  will  skirt  the  formidable  technicalities  of  the 
subject. 

1.  Following  the  development  of  the  concept  of  formal  axiomatics  by 
David  Hilbert  and  others  in  the  early  part  of  the  twentieth  century,  the 
axiomatic  method  was  vigorously  exploited.  A  number  of  new  and  old 
branches  of  mathematics  were  supplied  with  sets  of  axioms  which  were 
thought  to  be  adequate  for  the  settlement  of  any  question  involving  the 
primitive  terms  of  those  branches  of  mathematics;  that  is,  the  branches  of 
mathematics  were  supplied  with  what  were  thought  to  be  complete  sets  of 
axioms.  For  example,  it  was  thought  that  the  Peano  axiom  set  for  the  natural 
number  system  was  complete,  or,  if  not  complete,  that  it  could  surely  be 
made  so  by  the  addition  of  one  or  more  further  axioms.  This  belief  was 
shattered  by  Godel’s  paper,  for  in  that  paper  Godel  proved  the  following 
theorem: 

Godel’s  First  Theorem.  For  any  consistent  formal  system  L  which  contains 
the  natural  number  system  (such  as  that  of  Principia  mathematica,  for  example), 
there  are  undecidable  propositions  in  L;  that  is,  there  are  propositions  F  in  L  such 
that  neither  F  nor  not- F  is  provable  in  L. 

It  follows  that  any  postulate  set  for  the  natural  number  system  must, 
if  consistent,  be  incomplete.  In  other  words,  no  matter  what  consistent 
postulate  set  one  might  adopt  for  the  natural  number  system,  there  will  be 
statements  F  about  the  natural  numbers  such  that  neither  F  nor  not-F  is 
provable  from  the  postulates. 

There  are  many  famous  conjectures  in  number  theory  that,  in  spite  of 
extended  and  strenuous  effort,  have  never  been  established  nor  disestablished. 
Among  these  are,  for  example,  the  Goldbach  conjecture  (that  every  even 
integer  greater  than  2  can  be  expressed  as  the  sum  of  two  primes)  and  the 
conjecture  known  as  Fermat’s  last  theorem  (that  there  do  not  exist  positive 
integers  x,y,  z  such  that  xn  +  yn  =  zn  when  n  is  a  positive  integer  greater  than 
2).  It  could  be  that  the  statements  about  natural  numbers  involved  in  these 
conjectures  are  among  those  propositions  which  are  undecidable  in,  say, 
the  Peano  development  of  the  natural  number  system,  and  this  could  be 
the  reason  no  one  has  been  able  to  establish  or  disestablish  the  conjectures. 
It  would  surely  be  some  gain  if  of  a  given  statement  about  the  natural 
numbers  one  could  at  least  determine  whether  or  not  it  is  a  provable  propo¬ 
sition  of  the  system.  But  here,  too,  the  outlook  is  melancholy,  for  in  1936 
Alonzo  Church*  proved  the  following  theorem: 


*See  A.  Church  [7]. 
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Church’s  Theorem.  For  any  consistent  formal  system  L  containing  the  natural 
number  system ,  there  exists  no  effective  method  of  deciding  which  propositions  of  L 
are  provable  in  L. 

2.  We  have  seen,  in  our  discussion  of  the  formalist  school  of  philosophy 
of  mathematics  in  Chapter  9,  that  David  Hilbert  attempted  to  establish  the 
inner  consistency  of  classical  mathematics.  He  had  hoped  to  accomplish  this 
by  showing  that  the  permitted  rules  of  logical  procedure  (which  are,  of 
course,  part  and  parcel  of  the  formal  development)  could  never  lead  one  to 
deduce  two  contradictory  propositions  F  and  not-F.  Though  Hilbert  was 
able  to  illustrate  with  certain  simple  systems  what  he  hoped  to  do  for  the 
more  complicated  system  of  all  classical  mathematics,  he  was  unable  to  carry 
out  his  consistency  proof  for  the  latter  system.  This  was  shown  by  Godel  to 
be  inevitable,  for  in  his  famous  paper  he  also  managed  to  prove  the  following 
theorem: 

Godel’s  Second  Theorem.  For  any  consistent  formal  system  L  which  contains 
the  natural  number  system ,  the  consistency  of  L  cannot  be  proved  in  L. 

It  follows  that  among  the  undecidable  problems  in  L ,  that  of  the  con¬ 
sistency  of  L  is  one.  This  shattered  Hilbert’s  hopes  as  originally  envisioned, 
and  it  now  seems  that  the  internal  consistency  of  classical  mathematics  can¬ 
not  be  attained  unless  one  adopts  principles  of  reasoning  of  such  complexity 
that  the  internal  consistency  of  these  principles  is  as  open  to  doubt  as  that 
of  classical  mathematics  itself. 

3.  Godel’s  two  theorems  have  shown  that  no  complete  axiomatic 
development  of  certain  important  sectors  of  mathematics  is  attainable,  and 
that  no  truly  impeccable  guarantee  can  be  given  that  certain  important 
sectors  of  mathematics  are  free  of  internal  contradiction.  These  are  severe 
limitations  of  the  axiomatic  method,  and  they  point  up  that  the  processes 
of  mathematical  proof  may  not,  indeed  probably  do  not,  coincide  with  formal 
axiomatic  procedures.  No  prior  limits  can  be  imposed  on  the  inventiveness 
of  mathematicians  in  devising  new  procedures  of  proof.  Apparently  human 
intellectual  resources  cannot  be  fully  formalized,  and  new  principles  of  proof 
await  discovery  and  invention.  All  this  has  obvious  repercussions  in  •any 
discussion  of  a  philosophy  of  mathematics,  and  shows  that  some  presently 
widely-held  philosophies  of  mathematics  must  be  revamped  or  scrapped. 

4.  Though  the  proofs  of  Godel’s  theorems  (as  supplied  by  Godel  him¬ 
self,  or  later  by  others,  like  Barkley  Rosser  and  S.  G.  Kleene)  are  too  technical 
to  be  given  here,  something  should  be  said  of  Godel’s  remarkable  device  for 
“arithmetizing”  his  treatment.  The  device  has  proven  to  be  very  powerful 
and  surprisingly  applicable,  and  it  somewhat  compares,  in  its  area  of  appli¬ 
cability,  with  the  fertile  method  of  Descartes  in  geometry. 
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GodePs  device  lies  in  a  method  of  numbering  his  primitive  symbols, 
his  formulas,  and  his  proofs.  In  any  completely  formalized  system  L,  there 
are,  first  of  all,  basic  symbols  in  terms  of  which  propositions  and  proofs  are 
expressed.  For  instance,  in  GodePs  system  L,  appear  (among  others)  the 
symbols 

f,  G  ), 

and  a  denumerably  infinite  set  of  variables  in  each  of  a  denumerably  infinite 
set  of  types.  Godel  chose  to  assign  the  numbers  3,  5,  11,  and  13,  respectively, 
to  the  above  symbols,  and 

17n,  1971,  23n,  29n,  •  •  • 

to  the  variables  of  type  n,  where  17,  19,  23,  29,  •  •  •  is  the  sequence  of  con¬ 
secutive  primes  after  the  prime  13.  We  shall  call  these  numbers,  assigned  to 
the  basic  symbols,  Godel  numbers. 

Now  a  proposition  in  L,  expressed  in  the  symbolism  of  L,  is  nothing 
but  a  finite  ordered  sequence  of  basic  symbols  of  L.  Such  sequences  are  called 
formulas ,  and  a  unique  number  is  assigned  to  each  formula  as  follows:  Let 
n(  1),  n( 2),  •  •  •,  n(s)  be  the  Godel  numbers  of  the  symbols  of  a  formula  F  in 
the  order  in  which  they  occur  in  F,  and  let  p  i,  p2,  •  •  •,  ps  be  the  first 
consecutive  primes,  starting  with  pi  —  2.  Then  the  number  assigned  to  the 
formula  F,  which  we  will  call  the  Godel  number  of  F ,  is  the  product 

pn{\)pn{2)  .  .  .  pn{8)m 

For  example,  consider  the  formula  (or  part  of  a  formula) 

~(*C /»), 

where  x  and  y  are  variables  of  types  2  and  1,  respectively,  possessing  the 
Godel  numbers  172  and  17.  Then  the  Godel  numbers  of  the  consecutive 
symbols  of  the  formula  are  5,  11,  289,  11,  3,  17,  13,  13,  whence  the  Godel 
number  of  the  formula  itself  is  the  product 

25311528971111T31T71T913. 

It  follows  that  for  every  proposition,  or  formula,  there  is  a  unique  Godel 
number.  Also,  given  the  Godel  number  of  a  formula,  the  formula  itself  can 
be  regained.  All  one  has  to  do  is  to  factor  the  number  of  the  formula  into 
its  unique  product  of  prime  factors.  Then  the  number  of  2  s  occurring  in 
the  factorization  is  the  Godel  number  of  the  first  symbol  of  the  formula, 
the  number  of  3’s  occurring  in  the  factorization  is  the  Godel  number  of  the 
second  symbol  of  the  formula,  the  number  of  5’s  occurring  in  the  factoriza¬ 
tion  is  the  Godel  number  of  the  third  symbol  in  the  formula,  and  so  on. 

Finally,  a  proof  is  merely  a  finite  ordered  sequence  of  formulas,  and  a 
device  can  be  used  to  assign  a  Godel  number  to  a  proof  similar  to  that  used 
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to  assign  a  Godel  number  to  a  formula.  Suppose,  for  example,  that  a  proof 
is  made  up  of  the  ordered  sequence  of  formulas  Fh  F2,  •  •  • ,  Ft.  Let  f(i)  be 
the  Godel  number  of  formula  Fi.  Then  the  Godel  number  of  the  proof  will 
be  the  product 

Plf(D  p2f( 2)  .  .  .  ^/(h 

where  pi,  p2,  •  •  •,  pt  is  the  sequence  of  the  first  t  consecutive  primes,  starting 
with  pi  =  2.  As  before,  given  the  Godel  number  of  a  proof,  the  steps,  or 
consecutive  formulas,  of  the  proof  can  be  regained  by  factorization. 

A  very  interesting  and  important  feature  of  Godel’s  device  is  that  it 
permits  metamathematical  statements— that  is,  statements  about  the  formal 
system  L— to  be  “translated”  into  statements  about  numbers.  For  when 
Godel  numbers  are  assigned  to  the  formulas  of  L,  then  a  statement  A  about 
these  formulas  can  be  replaced  by  a  statement  B  about  the  associated  Godel 
numbers  of  the  formulas,  such  that  statement  B  is  true  if  and  only  if  statement 
A  is  true.  For  example,  suppose  statement  A  says,  “Formula  Fi  consists  of 
formula  F2  with  some  more  symbols  added  at  the  end.”  For  statement  B 
we  may  then  take,  “The  Godel  number  of  Fi  is  a  factor  of  the  Godel  number 
of  F2.”  Now  it  happens  that  an  important  class  of  these  B  statements  can 
themselves  be  expressed  as  formulas  in  the  system  L,  with  their  own  Godel 
numbers,  and  the  formal  system  L  can,  to  this  extent,  “talk  about  itself.” 
It  is  this  ability  to  arithmetize  the  metamathematics  of  the  formal  system  L 
that  permitted  Godel  to  prove  his  two  remarkable  theorems,  which  are,  of 
course,  metamethematical  statements  about  his  system  L. 

Before  closing  this  appendix,  we  will  give  a  simple  application  of  the 
Godel  numbering  device.  Let  us  use  the  device  to  give  another  proof  of  the 
fact  that  the  set  of  all  rational  numbers  is  denumerable.  To  this  end,  we  first 
note  that  any  rational  number  can  be  uniquely  written  in  the  form  (  — 1  )np/q, 
where  n ,  p,  q  are  nonnegative  integers  chosen  to  be  as  small  as  possible  and 
q  0.  As  examples,  we  have 

-18/60  =  (  — l)1  3/10  and  8/12  =  (  — 1)°  2/3. 

Now  set  up  the  correspondence 

(-1  )nplq  ^  2n3p57, 

and  call  the  latter  number  the  Godel  index  of  the  rational  number  (  — 1  )np/q. 
As  an  example,  the  Godel  index  of  —3/4  is  213354  =  33,750.  There  is  clearly 
a  one-to-one  correspondence  between  the  rational  numbers  and  their  Godel 
indices.  The  enumeration  of  the  rational  numbers  is  now  obtained  by  listing 
the  rational  numbers  in  the  order  of  magnitude  of  their  Godel  indices. 
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SUGGESTIONS  FOR  THE  SOLUTION  OF 
_ SOME  OF  THE  PROBLEMS 

1.1.3  Let  the  quadrilateral  be  ABCD ,  with  AB  —  a ,  BC  —  b,  CD  =  c,  DA  =  d. 
Show  that  2 K  —  ad  sin  A  +  be  sin  C  ^  ad  +  bc\  similarly,  2 K  ^  ab  +  cd;  therefore 
4 K  5=  ad  -f-  be  +  ab  +  cd  =  (a  +  c){b  +  d).  This  formula,  like  the  ancient  Babylonian 
formula  for  the  volume  of  a  frustum  of  a  cone  or  square  pyramid,  illustrates  the  prev¬ 
alent  idea  in  early  empirical  geometry  of  averaging. 

1.1.7  Show  that  angle  AOF  —  10°  00/  53  An  angle  of  30°  cannot  be  trisected 
exactly  with  straightedge  and  compasses. 

1.1.9  Use  the  rule  of  false  position  by  taking  BC  to  be  any  convenient  length,  say 
1  unit,  and  on  this  basis  calculate,  by  the  law  of  sines,  BA  and  BD,  and  then,  by  the 
law  of  cosines,  AD\  etc.  This  is  a  highly  practicable  procedure  for  solving  this  problem 
and  is  actually  the  method  a  surveyor  would  probably  use. 

1.1.11  By  fraction ,  Ahmes  means  a  fraction  having  unit  numerator.  Only  the 
denominators  of  these  unit  fractions  were  written. 

1.1.13  Take  -k  =  3. 

1.1.14  (a)  Set  *  =  2 \y.  (b)  Eliminate  x  and  y,  obtaining  a  cubic  equation  in  z. 

1.2.2  It  is  fallacious  to  assume  that  a  high  degree  of  correlation  indicates  a  causal 
relationship.  The  high  correlation  may  indicate  some  common  cause,  or  it  may  be 
purely  accidental. 

1.2.4  Instead  of  isogonal  conjugate  lines  of  a  plane  angle,  consider  isogonal  conjugate 
planes  of  a  dihedral  angle. 

1.2.5  Use  diagrams. 

1.2.10  Examples  can  be  constructed  for  every  case  except  the  second. 
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1.3.2  Use  the  fact  that  (Va  —  V  b)2  ^  0. 

1.3.4  Construct  a  second  triangle  which  is  a  right  triangle  and  has  the  same  legs. 

1.3.6  (b)  ab  is  the  fourth  proportional  to  1,  a,  b.  (c)  a/b  is  the  fourth  proportional 
to  b,  1,  a.  (d)  ^ a  is  a  mean  proportional  between  1  and  a. 

1.3.7  A  convex  polyhedral  angle  must  contain  at  least  three  faces,  and  the  sum  of 
its  face  angles  must  be  less  than  360°. 

1.4.5  Let  the  first  player  place  his  first  cigar  so  that  its  center  is  directly  over  the 
center  of  the  table,  and  let  him  thenceforth  place  his  cigars  in  the  positions  centrally 
symmetrical  to  those  occupied  by  the  second  player’s  cigars. 

1.5.2  Follow  the  proof,  given  in  the  text,  of  the  irrationality  of  V2. 

1.5.3  See,  e.g.,  H.  Eves,  Section  3-5. 

1.5.5  Show,  from  a  figure,  that  ab  =  2 rs  and  a  -f-  b  =  r  -f-  s,  and  then  solve  simul¬ 
taneously.  This  problem,  as  well  as  the  previous  one,  illustrates  the  arithmetical 
nature  of  Heron’s  geometry,  which  is  more  akin  to  oriental  than  to  typical  Greek 
geometry. 

1.5.6  (a)  Let  a  ray  of  light  emanating  from  a  point  A  hit  the  mirror  at  point  M 
and  reflect  toward  a  point  B.  If  B'  is  the  image  of  B  in  the  mirror,  then  BB'  is  per¬ 
pendicularly  bisected  by  the  plane  of  the  mirror,  and  we  must  have  AMB'  a  straight 
line,  (b)  Apply  part  (a). 


2.1.4  (c)  Suppose  a  >  b.  Then  the 


algorism  may  be  summarized  as  follows: 


a  =  qib  +  n, 

b  =  qtri  +  r2, 

H  =  <73^2  +  T  3, 


0  <  ri  <  b, 

0  <  r2  <  ri, 

0  <  r3  <  r2, 


— 2  qvXn—  1  ~f~  Y- n?  0  — lj 

i  qn+\rn. 

Now,  from  the  last  step,  rn  divides  rn-i-  From  the  next  to  the  last  step,  rn  divides 
rn- 2,  since  it  divides  both  terms  on  the  right.  Similarly,  rn  divides  rn_3.  Successively, 
rn  divides  each  r *■,  and  finally  a  and  b. 

On  the  other  hand,  from  the  first  step,  any  common  divisor  c  of  a  and  b  divides 
From  the  second  step,  c  then  divides  r2.  Successively,  c  divides  each  r*.  Thus  c 
divides  rn. 

(d)  From  the  next  to  the  last  step  in  the  algorism  we  can  express  rn  in  terms  of 
rn- 1  and  rn_2.  From  the  preceding  step  we  can  then  express  rn  in  terms  of  rn_ 2  and 
rw_3.  Continuing  this  way,  we  finally  obtain  rn  in  terms  of  a  and  b. 

2.1.5  (a)  If  p  does  not  divide  u,  then  integers  P  and  ()  exist  such  that  Pp  +  Qu 
=  1,  or  Ppv  +  Quv  =  v. 

(b)  Suppose  there  are  two  prime  factorizations  of  the  integer  n.  If  p  is  one  of  the 
prime  factors  in  the  first  factorization,  it  must,  by  part  (a),  divide  one  of  the  factors 
in  the  second  factorization,  that  is,  coincide  with  one  of  the  factors. 


2.1.6  (c)  For  each  b{  in  part  (b)  may  have  a{  +  1  values. 

(0  Since  b  divides  ac ,  we  have  b{  ^  ai  +  C{.  Also,  since  a  and  b  are  relatively 
prime,  we  have  ai  =  0  or  b{  =  0.  In  either  case,  bi  ^  ci. 

(h)  Suppose,  employing  the  definition  of  a  rational  number,  that  V2  =  afb, 
where  a  and  b  are  positive  integers.  Then,  since  a 2  =  2b 2,  we  have  ( 2a\ ,  2 a2,  •  •  •)  = 
(1  +  2 bi,  2b 2,  •  •  •)>  whence  2a\  =  1  +  2bi,  which  is  impossible. 
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2.1.7  Use  the  indirect  method,  and  assume  there  are  only  a  finite  number  of  prime 
numbers,  a,  b,  •••,£.  Set  P  =  ( a)(b )•  •  •  (k),  and  consider  the  number  P  -f-  1. 

2.1.8  (a)  Show  that  2mn  —  1  contains  the  factor  2m  —  1. 

(c)  If  ai,  a2,  •  •  • ,  an  represent  all  the  divisors  of  N,  then  Nja\,  N/d2,  •  •  • ,  N/an 
also  represent  all  the  divisors  of  JV. 

2.3.3  (b)  Let  A  be  the  given  point  and  BC  the  given  line  segment.  Construct,  by 
Proposition  1,  an  equilateral  triangle  ABD.  Draw  circle  B(C),  and  let  DB,  produced 
if  necessary,  cut  this  circle  in  G.  Now  draw  circle  D(G)  to  cut  DA,  produced  if  neces¬ 
sary,  in  L.  Then  AL  is  the  sought  segment. 

2.4.3  Verification  of  the  first  four  postulates  presents  little  difficulty.  To  verify  the 
fifth  postulate  it  suffices  to  show  that  two  ordinarily  intersecting  straight  lines,  each 
determined  by  a  pair  of  restricted  points,  intersect  in  a  restricted  point.  This  may 
be  accomplished  by  showing  that  the  equation  of  a  straight  line  determined  by  two 
points  having  rational  coordinates  has  rational  coefficients,  and  that  two  such  lines, 
if  they  intersect,  must  intersect  in  a  point  having  rational  coordinates.  For  the  last 
part  of  the  problem,  consider  the  unit  circle  with  center  at  the  origin,  and  the  line 
through  the  origin  having  slope  one. 

2.4.5  (a)  Let  the  line  enter  the  triangle  through  vertex  A.  Take  any  point  U  on 
the  line  and  lying  inside  the  triangle.  Let  V  be  any  point  on  the  segment  AC,  and 
draw  line  UV.  By  Pasch’s  postulate,  UV  will  (1)  cut  AB,  or  (2)  cut  BC,  or  (3)  pass 
through  B.  If  UV  cuts  AB,  denote  the  point  of  intersection  by  W  and  draw  WC;  now 
apply  Pasch’s  postulate,  in  turn,  to  triangles  VWC  and  BWC.  If  UV  cuts  BC,  denote 
the  point  of  intersection  by  R;  now  apply  Pasch’s  postulate  to  triangle  VRC.  If  UV 
passes  through  B,  apply  Pasch’s  postulate  to  triangle  VBC. 

3.1.2  To  deduce  Euclid’s  fifth  postulate,  let  AB  and  CD  be  cut  by  the  transversal 
ST,  and  suppose  ^ BST  -f-  DTS  <  180°.  Through  S  draw  QSR,  making  ^RST 
+  DTS  =  180°.  Now  apply,  in  turn,  I  28,  Playfair’s  postulate,  I  17. 

3.1.4  (b)  Draw  P(l  perpendicular  to  m,  and  on  m  mark  off  QQi  =  PQ_-  Then 
<  QPQ\  —  $.Q,QiP  —  180°/22.  Now  on  m  mark  off,  in  the  same  direction  as  QQ\, 
Q,iQ,2  =  PQi.  Then  $iQj.PQj.  =  ^ QjQjiP  =  180°/23.  Repeat  the  construction 
obtaining  Qn-iPQjn-i  =  ^Qn-zQn-iP  —  180°/2n.  By  the  postulate  of  Archimedes, 
there  exists  an  integer  k  such  that  ka  >  180°.  Take  n  so  large  that  2n  >  k.  Then 
180°/2"  <  a.  Etc. 

(c)  ^QRP  +  <£ RPQ  =  <£APR  +  <£RPQ  =  90°. 

3.1.5  To  deduce  Playfair’s  postulate,  let  P  be  a  given  point  and  m  a  given  line  not 
passing  through  P.  Draw  PQ  perpendicular  to  m,  and  then  draw  PA  perpendicular 
to  PQj.  By  I  28,  PA  is  parallel  to  m.  Now  use  3.1.4  (b),  3.1.4  (c),  and  Pasch’s  postulate 
to  show  that  PA  is  the  only  line  through  P  parallel  to  m. 

3.1.6  Try  to  construct  a  proof  along  the  following  lines.  Let  ABC  be  any  right 
triangle  and  draw  the  perpendicular  CD  to  the  hypotenuse.  Then  triangles  ABC  and 
ACD  have  two  angles  of  one  triangle  equal  to  two  angles  of  the  other  triangle, 
whence  the  third  angles  must  also  be  equal.  That  is,  B  =  <£ ACD .  Similarly, 

A  =  <£ BCD .  It  now  follows  that  <£ff  +  <£R-b<£C=  180°.  Now,  if  ABC  is  any 
triangle,  divide  it  into  two  right  triangles  by  an  altitude. 

3.1.7  Try  the  same  experiment  and  reasoning  on  a  spherical  triangle,  using  a 
great  circle  arc  in  place  of  the  straightedge. 

3.2.1  (a)  We  are  given  (BD/DC)(CE/EA)(AF/FB)  =  1,  and  we  wish  to  prove 
that  AD,  BE,  CF  are  concurrent.  Suppose  the  contrary.  Let  BE  and  CF  intersect  in 
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O ,  and  let  OA  intersect  BC  in  D '.  Then,  by  Ceva’s  theorem,  {BDr /D'C)  (CEjEA) 
(AF/FB)  =  1.  Etc. 

3.2.3  (c)  Drop  a  perpendicular  from  the  vertex  of  the  triangle  upon  the  line  joining 
the  midpoints  of  the  two  sides  of  the  triangle. 

3.3.1  See  Problem  3.2.3  (b). 

3.3.2  Consult  any  standard  text  on  elementary  solid  geometry. 

3.3.6  (b)  Let  ABC  be  a  triangle  having  the  sum  of  its  angles  equal  to  two  right 
angles.  If  ABC  is  not  already  right  isosceles,  draw  the  altitude  BD.  If  neither  of  the 
resulting  triangles  is  right  isosceles,  mark  off  on  the  longer  leg  of  one  of  them  a 
segment  equal  to  the  shorter  leg.  By  part  (a),  the  resulting  isosceles  right  triangle  has 
the  sum  of  its  angles  equal  to  two  right  angles.  By  putting  together  two  such  con¬ 
gruent  isosceles  right  triangles,  a  quadrilateral  can  be  formed  having  all  its  sides 
equal  and  all  its  angles  right  angles.  By  putting  together  four  such  congruent  quadri¬ 
laterals,  a  larger  quadrilateral  of  the  same  kind  can  be  formed.  By  repeating  the 
last  construction  enough  times,  one  can  obtain  a  quadrilateral  of  the  same  kind 
having  its  sides  greater  in  length  than  any  given  segment.  A  diagonal  of  this  last 
quadrilateral  will  give  an  isosceles  right  triangle  of  the  type  desired. 

(c)  Let  ABC  be  any  right  triangle,  right-angled  at  C.  By  part  (b),  there  exists  an 
isosceles  right  triangle  DEF,  right-angled  at  E,  having  the  sum  of  its  angles  equal  to 
two  right  angles  and  its  legs  greater  than  either  of  the  legs  of  triangle  ABC.  Now 
produce  CA  and  CB  to  A'  and  B',  respectively,  so  that  CA’  —  CB'  —  ED.  Then 
triangles  DEF  and  A'CB'  are  congruent.  Draw  A'B,  and  apply  part  (a)  to  triangle 
A'CB'.  The  extension  to  any  triangle  ABC  is  now  easily  accomplished  by  dividing 
ABC  into  two  right  triangles  by  one  of  its  altitudes. 

3.4.1  (b)  Divide  triangle  ABC  into  two  right  triangles,  and  apply  part  (a). 

3.5.4  (a)  Use  the  relations  of  Problem  3.5.3. 

(c)  This  is  an  immediate  consequence  of  parts  (a)  and  (b). 

(e)  K  =  -(1 IQPXHQT)  =  -1 KOFY  =  -1  !k\ 

3.5.5  This  follows  readily  from  the  definition  of  the  tractrix. 

3.5.8  (a)  One  might  interpret  “aba”  as  “committee”  and  “daba”  as  £  committee 
member,”  and  assume  that  there  are  just  two  committees  and  that  no  committee 
member  serves  on  more  than  one  committee. 

(b)  One  might  interpret  a  “daba”  as  any  one  of  the  three  letters  a,  b ,  c ,  and  an 
“aba”  as  any  one  of  the  three  pairs,  ab ,  be,  ca ,  of  these  letters. 

4.1.4  (j)  Neither;  the  amounts  are  equal,  (k)  The  final  pile  will  be  over  17,000,000 
miles  high ! 

4.1.7  (a)-2  Let  the  three  points  be  A,  B,  C ,  and  let  T  be  a  motion  transforming 
A  into  A',  B  into  B',  C  into  C' .  Designate  the  inverse  of  T  by  T~l.  Let  R  be  any 
effective  motion  which  leaves  A  and  B  fixed.  Now  consider  the  resultant  of  the  three 
motions  T~\  R,  T,  made  in  this  order. 

(b) -l  Let  the  sphere  have  center  A  and  pass  through  B,  and  let  P  be  a  point  on 
the  sphere.  Let  T  be  a  motion  transforming  A  into  A',  B  into  B',  P  into  P'.  Designate 
the  inverse  of  T  by  T~  K  Let  S  be  a  motion  which  leaves  A  fixed  but  transforms  B  into 

P.  Now  consider  the  resultant  of  the  three  motions  T~\  S,  T,  made  in  this  order. 

(c)  Pieri  says  AC  is  perpendicular  to  AB  if  there  exists  a  motion  which  leaves  A 
and  B  fixed  but  transforms  C  into  another  point  of  the  straight  line  CA. 

4.2.2  (d)  To  show  the  congruence  of  the  remaining  sides,  employ  reductio  ad 
absurdum. 
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4.2.4  (a)  Let  h  be  a  horn  angle  for  which  Q  —  0,  and  let  h'  be  any  horn  angle  for 
which  6  >  0.  Then,  for  every  positive  integer  n,  we  have  nh  <  //. 

(d)  If  a  =  a'  and  k  =  k',  set  M  =  Af';  if  a  >  a',  set  M  >  M';  if  a  =  a'  but 
A:  >  k',  set  M  >  M' .  Define  M'  =  nM  if  and  only  if  a '  =  na,  k'  =  nk. 

(f)  See  part  (e). 

4.2.5  Huntington  made  the  following  definitions.  A  sphere  which  does  not  include 
any  other  sphere  is  called  a  point.  If  A  and  B  are  two  points,  the  segment  [ AB  ]  is  the 
class  of  all  points  X  such  that  every  sphere  which  includes  A  and  B  also  includes  X. 
The  extension  of  [AB]  beyond  A  is  the  class  of  all  points  X  such  that  [BX]  contains  A; 
similarly,  the  extension  of  [AB]  beyond  B  is  the  class  of  all  points  X  such  that  [AX] 
contains  B.  The  ray  AB  is  the  class  of  all  points  belonging  to  [AB]  or  to  the  extension 
of  [AB ]  beyond  B.  The  line  AB  is  the  class  of  all  points  belonging  to  [AB]  or  to  one 
of  its  extensions. 

4.3.2  (d)  See  the  verification,  in  Section  4.3,  of  Postulate  1-1. 

(0  Draw  from  0  the  perpendicular  OP  to  the  line.  Let  P'  be  the  inverse  of  P, 
and  Q'  the  inverse  of  any  other  point  Q  on  the  line.  Now  apply  part  (e). 

(g)  Let  C  be  the  center  of  the  circle.  Draw  OC,  cutting  the  circle  in  Q,  and  R, 
and  let  P  be  any  other  point  on  the  circle.  If  P' ,  Q',  R'  are  the  inverses  of  P,  Q,  R, 
show,  bv  the  aid  of  part  (e),  that  $ 'R'P'Q '  =  <^QPR  =  90°. 

(h)  See  the  verification,  in  Section  4.3,  of  Postulate  1-1. 

0)  Take  p  equal  to  the  length  of  the  tangent  from  the  given  point  to  the  given 
circle. 

4.3.3  Consider  two  curves  C i,  C2  intersecting  in  point  R,  and  let  their  inverses 
C' i,  C' 2  intersect  in  point  R'.  Then  0,  R,  R'  are  collinear,  where  0  is  the  center  of 
inversion.  Through  0,  and  close  to  the  ray  ORR',  draw  a  second  ray  cutting  Cy,  C2, 
C'h  C'2  in  P,  Q,  P',  Qr,  respectively.  Using  Problem  4.3.2,  part  (e),  show  that  $iPRQ 
=  ^P'R'Q.  Now  let  the  second  ray  rotate  about  0  and  approach  ray  ORR'  as  a 
limiting  position. 

4.3.4  (a)  Let  P,  Q,  S,  T  be  four  points  on  a  circle  C  and  let  P',  Q',  S',  T'  be  their 
inverses  on  the  inverse  curve  C' .  If  0  is  the  center  of  inversion,  show  that  PTlP'T' 
=  OT/OP',  P'S' IPS  =  OP' I  OS,  QT/Q'T'  =  OT/OQ [',  Q'S'/QS  =  OQ^jOS,  and 
therefore  that  {PQ,  TS)  =  ( P'Q ,  T'S'). 

4.3.5  Let  the  circles  along  the  sides  AC  and  BC  of  triangle  ABC  intersect  again 
in  R.  Take  R  as  the  center  of  inversion. 

4.3.6  (b)  Use  Problems  4.3.5  and  4.3.3. 

4.4.2  (b)  We  have,  for  any  four  real  numbers  a\,  a2,  by,  b2,  {ay  +  by)2  +  (a 2  +  b2 )2 

=  ay2  -j-  a22  +  2{ayby  +  a2b2)  -f-  by2  +  b22  ^  ay2  +  a22  +  2a/ ay2  +  a22  M by2  +  b22 

+  by2  +  b22, 
or 

{ay  +  by)2  +  {a 2  +  b2)2  ^  \^ay2  +  a22  +  V by2  +  b22. 

Now  set  ay  =  Xy  x2,  a2  —  yy  —  y 2,  by  =  x2  —  xs,  b2  =  y2  —  y3. 

4.4.4  Let  (*1,^1)  and  {x2,y2)  be  the  points  A  and  B,  {x3,y3)  the  point  A'  on  the 
line  m,  and  a{x  —  x3)  +  b(y  —  yz)  =0  an  equation  of  line  m.  The  determination  of 
the  points  B  and  B  is  the  algebraic  problem  of  finding  the  common  solutions  of  the 
equations 

(*2  —  *i)2  +  {y<i  —  yy)2  =  {x  —  x3)2  +  (y  —  y3)2, 
a{x  —  x3)  +  b{y  —  y3)  =  0. 
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Show  that  these  equations  have  two  and  only  two  solutions.  Finally,  show  that 
(*3,73)  is  the  midpoint  [see  Problem  4.4.1,  part  (e)  ]  between  points  B'  and  B 

4.4.8  Let  A,  B,  C  be  the  vertices  of  one  triangle,  and  denote  the  lengths  of  the  three 
opposite  sides  by  a,  b,  c.  Show  that  the  law  of  cosines, 

a2  =  b2  +  c2  —  2 be  cos  A, 

holds  in  the  algebraic  model.  If  we  use  primes  for  the  second  triangle,  it  now  follows 
that  a2  =  a2.  Then,  by  the  law  of  cosines 

b2  =  a2  -f-  c2  —  lac  cos  B, 

applied  to  each  triangle,  it  follows  that  cos  B  =  cos  B',  or  B  =  B' . 

4.4.9  Let  A  and  B  be  the  points  (vi,jvi),  (*2,72).  Then  the  coordinates  of  A\  are 
given  by  4  of  Section  4.4  by  taking  t  =  a  lb,  where  a  and  b  are  the  lengths  of  the 
segments  CD  and  AB,  respectively,  Now,  in  4  of  Section  4.4,  replace  t  by  2/,  3 1, 
and  so  on,  thus  obtaining  the  coordinates  of  points  A2,  A 3,  and  so  on. 

4.4.11  Use  Problem  4.3.5. 

4.5.1  (a)  Let  points  1  and  6  coincide,  so  that  line  16  becomes  the  tangent  to  the 
conic  at  point  1.  (b)  Use  part  (a),  (c)  Let  1,  2,  3,  4  be  the  four  points  and  45  the 
tangent  at  4  =  5,  and  let  12  cut  45  in  P.  Through  1  draw  any  line  16  cutting  34  in 
R,  and  then  draw  the  Pascal  line  PR  to  cut  23  in  £L  Then  5Q  cuts  16  in  a  point  6  on 
the  conic,  (d)  Take  1=6  and  3  =  4,  and  then  take  2  =  3  and  5  =  6.  (e)  Take 
1  =  2,  3  =  4,  5  =  6.  (f)  Use  part  (e). 

4.5.2  (a)  Apply  the  law  of  sines  to  triangles  VAP  and  VPB,  remembering  that 
sin  VP  A  =  sin  BPV.  (b)  Use  part  (a),  (c)  Use  part  (b). 

4.5.3  (b)  Dualize  the  solution  of  4.5.1  (b).  (c)  Dualize  the  solution  of  4.5.1  (c). 
(0  Dualize  the  solution  of  4.5.1  (f).  (g)  The  theorem  is  its  own  dual. 

4.5.4  (a)  Choose  for  tt'  a  plane  parallel  to  the  plane  determined  by  point  0  and 
line  m.  (b)  Project  line  OU  to  infinity,  (c)  Project  line  LM N  to  infinity,  and  use  the 
elementary  fact  that  the  joins  of  corresponding  vertices  of  two  similar  and  similarly 
situated  triangles  are  concurrent,  (d)  Choose  a  plane  tc'  parallel  to  the  minor  axis 
of  the  ellipse  and  such  that  the  angle  6  between  7 r'  and  the  plane  of  the  given  ellipse 
is  arc  cos  bfa,  where  a  and  b  are  the  semimajor  and  semiminor  axes  of  the  ellipse. 
Now  project  the  ellipse  orthogonally  onto  n'. 

4.5.5  Let  P  be  the  given  point.  Construct  the  polar  p  of  P  with  respect  to  the  ellipse, 
and  let p  cut  the  ellipse  in  points  L  and  M.  Then  PL  and  PM  are  the  required  tangents. 

5.1.2  (a)  No.  (b)  Yes. 

5.1.3  (e)  These  operations  are  commutative  with  respect  to  each  other. 

5.1.6  (a)  Replacing  a  by  z'  in  the  first  equality,  and  by  £  in  the  second  equality, 
we  find  z'  ®)  Z  —  z'  1  £®  z'  —  Therefore  z'  =  z'  ®)  Z  —  £®  Z  —  Z- .  (b)  Add 
each  member  of  the  equality  a  ®)  b  —  a  ®)  c  to  a,  and  then  use  the  associative  law 
for  ®.  (c)  Show  that  x  —  a  ®)  b  is  a  solution;  then  show,  by  pait  (b),  that  if  there 
are  two  solutions,  x  andjy,  we  must  have  x  =  y.  (d)  By  the  distributive  law,  (a(®a) 
®  {a®  z)  =  a®  0®  Z)  =  a®  a  =  {a®  a)  ®  z.  Therefore,  by  part  (b),  a(®  c 

=  z-  (e)  Consider  the  triple  sum  (a®  b)  ®  (a®  b)  ®  (a®  b).  Combine  the  first 
two  summands,  and  use  the  distributive  law;  then  combine  the  last  two  summands, 
and  use  the  distributive  law.  (f)  Since  a  ®  b  =  z  —  a®)  Z,  we  have,  if  <27^  Z,.  by 
the  cancellation  law  for  the  operation  (X),  b  =  z-  (g)  Show  that  v  —  a  (X)  b  is  a 
solution;  then  show,  by  the  cancellation  law  for  the  operation  (X),  that  if  there  are 

two  solutions,  x  andjy,  we  must  have  x  —  y.  (h)  Show  that  ( a (X)  b)  ®)  (a®)  b)  z • 
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5.1.7  (e)  Since  <2  ($*)  b,  a(T)  b  is  positive.  Since  b  ©  c,  b  ©  c  is  positive.  Therefore, 
by  Postulate  PI 2,  (a  ©  b)  ©  (b  ©  c)  is  positive.  But  (a  ©  b)  ©  (b  ©  c)  =  a@~c. 
Hence  a  ©  c  is  positive,  and  a  ©  c.  (g)  Since  a  ©  b,  a  ©  b  is  positive.  Since  c  is 
given  positive,  (a©  b)  ©  c  is  positive,  by  Postulate  PI 2.  But  (a©  b)  ©  c  =  (a©  c) 

©  (b  ©  c ),  by  part  (h)  of  Problem  5.1.6.  Therefore  (a  ©  c)  ©  (b  ®  c).  (h)  This 
follows  from  parts  (e)  and  (h)  of  Problems  5.1.6.  (j)  Use  part  (i). 

5.2.1  (d)  The  zero  element  is  (0,0);  the  unity  element  is  (1,0);  if  x  =  (a,b),  then 
x  =  (  —  a,  —  b )  and  x~x  =  [ a/(a 2  +  b 2),  —  b/(a2  +  b 2)]. 


5.2.9  Define. 


jo  for  0  ^  x  ^  V2, 

—  Vi  for  x/i  <  x  tk  1, 


JO  for  0  ^  x  ^  %, 

\x  —  %  for  %  <  x  ^  1, 

j-x  +  1/2  for  0  ^  x  ^  y2i 
\0  for  Vz  <  x  ^  1. 


Then,  although  a(x)c(x)  =  b(x)c(x)  and  c(x)  ^  0,  we  do  not  have  a(x ) 
Thus  Postulate  P9  fails  to  hold. 


b(x). 


5.2.12  Show  that  ^  ^  ^ J  implies:  (1)  b(a  +  d)  =  1,  (2)  c(a  +  d)  =  0, 

(3)  a2  +  be  =  0,  (4)  cb  +  d2  =  0.  From  (1)  it  follows  that  a  +  d  9^  0.  Therefore, 
from  (2),  c  =  0.  Hence,  from  (3)  and  (4),  a  —  d  =  0.  This  contradicts  the  conclusion 
that  a  +  d  7^  0. 


5.2.18  (a)  *  is  neither  commutative  nor  associative:  [  is  both  commutative  and 
associative;  the  distributive  law  holds.  (b)  None  of  the  laws  holds,  (c)  Only  the 
two  commutative  laws  hold.  (d)  |  is  associative  and  the  distributive  law  holds. 

5.2.20  (c)  This  may  be  shown  in  several  ways,  but  each  is  tricky.  Look  up  the 
proof  as  given  in  some  textbook  on  vector  analysis. 

5.3.2  (a)  Yes.  (b)  No.  (c)  No.  (d)  No;  G1  does  not  hold,  (e)  Yes. 

5.3.5  No. 

5.3.8  (b)  Construct  the  operation  table  for  the  symmetric  group  of  degree  3. 

5.3.10  By  G2'  we  are  guaranteed  the  existence  of  an  element  i  such  that,  for  a 
given  element  b,  b  *  i  =  b.  Now  let  a  be  any  element  of  G.  By  G2'  there  exists  an 
element  c  such  that  a  —  c  *  b.  Then 

a  *  i  =  (c  *  b)  *  i  =  c  *  (b  *  i)  =  c  *  b  =  a, 

and  G2  is  established.  Finally,  by  G2',  there  exists  for  each  element  a  of  G  an  element 

cTx  of  G  such  that  a  *  a~l  =  i,  and  G3  is  established. 

5.3.11  No.  Let  G  be  the  set  of  real  linear  functions  of  the  form  a  =  a\x  +  a2,  a\  5^  0, 

and  let  a  *  b  mean  a(db/dx).  Then  v  is  a  right  identity  element  and  x/ai  is  a  left 

inverse  of  a. 


5.3.12  (c)  In  x~x  *  (*  *  jy)  =  set  y  =  e.  (e)  In  x~l  *  (x  *  jy)  =  y,  set  y  =  x~l. 
(f)  In  (X-1)-1  *  (*-1  *  y)  =  y,  set  y  =  *.  (h)  Star  both  sides  of  x  *  y  =  z  by  x~x 
on  the  left  and  use  the  left  inverse  property  to  simplify.  Then  star  both  sides  of  the 
result  by  z  1  on  the  right  and  simplify,  finally  on  the  left  by  y~x,  again  simplifying. 

5.4.3  (S-XT-X)(TS)  =  S-X(T~XT)S  =  ST'S  =  /. 
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5.4.4  (b)  Rotation  of  180°  about  the  origin,  reflection  in  a  line,  inversion  in  a 
fixed  circle. 


5.4.5  (a)  R'S'  =  {TRT^){TST~l)  =  TR{T~lT)ST^  =  T{RS)T~K 

(b)  ( TST -1)”1  =  TS~1T~1,  by  Problem  5.4.3. 

(c)  TST-1  =  ( TS)T~ 1  =  ( ST)T~ 1  =  S^TT-1)  =  & 

5.4.9  (b)  It  is  the  tangent  of  the  angle  from  the  first  line  to  the  second  line. 

5.4.10  (a)  Show,  for  example,  that  RH  =  D'  and  HR  —  D. 

(b)  /,  Rf,  H ,  V,  D ,  D'  are  self-inverse;  R  and  R"  are  inverse. 

5.5.5  See  R.  A.  Rosenbaum. 

5.5.12  See  F.  Harary. 


6.3.3  See  Example  (j)  of  Section  5.1. 

6.3.8  Interpret  the  elements  of  S  as  a  set  of  all  rectangular  Cartesian  frames  of 
reference  which  are  parallel  to  one  another  but  with  no  axis  of  one  frame  coincident 
with  an  axis  of  another  frame,  and  let  b  F  a  mean  that  the  origin  of  frame  b  is  in  the 
first  quadrant  of  frame  a.  Or,  interpret  the  elements  of  S  as  the  set  of  all  ordered 
pairs  of  real  numbers  ( m,n ),  and  let  (m,n)  F  (u,v)  mean  m  >  u  and  n  >  v. 


6.3.9  (a)  Interpret  K  as  the  set  of  all  points  of  a  circumference  of  a  circle  and  let 
R{abc )  mean  “points  a,  b,  c  lie  in  clockwise  order.” 

6.3.10  Interpret  the  bees  as  six  people,  A,  B ,  C,  D,  E,  F,  and  the  four  hives  as  the 
four  committees  (A,  B,  C),  (A,  D,  E ),  (B,  F,  E ),  and  (C,  F,  D ).  Or,  interpret  the 
bees  and  the  hives  as  six  trees  and  four  rows  of  trees,  respectively,  forming  the  vertices 
and  sides  of  a  complete  quadrilateral. 


6.4.1  Interpret  S  as  the  set  of  all  points  on  a  horizontal  straight  line,  and  let  F  mean 
“is  to  the  right  of.” 

6.4.2  (a)  To  show  independence  of  P2,  interpret  the  bees  and  the  hives  as  four 
trees  and  four  rows  of  trees  forming  the  vertices  and  sides  of  a  square.  To  show 
independence  of  P3,  interpret  the  bees  as  four  trees  located  at  the  vertices  and  the 
foot  of  an  altitude  of  an  equilateral  triangle,  and  the  hives  as  the  four  rows  of  trees 
along  the  sides  and  the  altitude  of  the  triangle.  To  show  independence  of  P4,  interpret 
the  bees  and  the  hives  as  three  trees  and  three  rows  of  trees  forming  the  vertices  and 
sides  of  a  triangle,  (b)  Taking  into  account  the  theorems  of  Problem  6.3.11,  let  us 
designate  the  four  hives  by  /,  //,  III ,  IV  and  the  three  bees  in  hive  I  by  A,  B,  C.  Let 
have  II  have  bee  A,  and  only  bee  A,  in  common  with  hive  I.  Then  we  may  designate 
the  bees  in  hive  II  by  A,  D,  E.  Now  let  hive  III  be  that  hive  which  has  bee  B  in  com¬ 
mon  with  hive  I.  Then  we  may  designate  the  bees  of  hive  III  by  either  B ,  D,  F  or 
B,  E,  F.  In  the  first  case,  hive  IV  must  contain  bees  C,  E,  F,  and,  in  the  second  case, 
hive  IV  must  contain  bees  C,  D,  F.  There  are,  then,  the  following  two  ways  of  desig¬ 
nating  the  hives  and  the  bees: 


I 

II 

III 

IV 


A,  B,  C 

A,  D,  E 

B,  D,  F 

C,  E,  F 


or 


/ 

II 

III 

IV 


A,  B,  C 

A,  D,  E 

B,  E,F 

C,  D,  F 


But  the  second  designation  can  be  changed  into  the  first  by  interchanging  the  two 
labels  D  and  E.  It  follows,  then,  that  any  interpretation  of  the  postulate  set  can  be 
labeled  as  in  the  first  designation,  and  the  desired  isomorphism  is  established. 


6.4.3  See  W.  T.  Guy,  jr. 

6.4.4  Show  that  there  is  essentially  only  one  way  of  labelling  the  four  elements  of  if. 
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6.4.9  (e)  For  a  solution,  and  an  interesting  discussion  by  O.  Veblen,  see  Problem 
2894,  American  Mathematical  Monthly,  29  (1922),  357-358. 

6.4.10  For  a  solution  see  Problem  E  1152,  American  Mathematical  Monthly,  62  (1955) 
582-584. 


6.4.11  For  a  solution  see  Problem  E  1098,  American  Mathematical  Monthly,  61 
(1954),  474. 

6.5.4  See  Section  9.3. 


6.5.7  (b)  In  one  interpretation  of  T  let  S  be  the  set  of  all  positive  integers,  let 
“ a  R  b”  mean  “a  is  a  proper  divisor  of  b”  and  let  “a  D  b”  also  mean  “ a  is  a  proper 
divisor  of  b”  In  the  second  interpretation  let  S  be  the  set  of  all  positive  integers,  let 
“ a  R  b”  mean  “a  is  a  proper  divisor  of  b”  and  let  “a  D  b”  mean  “b  is  a  proper  divisor 
of  a.” 


6.5.8  See  F.  Harary. 


7.1.5  —  -  not  — . 

3  3 

7.1.9  (a)  The  integral  is  improper,  since  the  integrand  is  discontinuous  at  x  =  0. 
(b)  Examine  for  end-point  maxima  and  minima. 

7.2.4  Consider  the  collection  JV  of  all  numbers  of  the  form  —a,  where  a  is  in  M. 

7.2.8  l.u.b.  =  2,  g.l.b.  =  -2;  l.u.b.  =  3/2,  g.l.b.  =  -1. 

7.2.9  No. 

7.2.11  If  r  >  0,  then,  by  the  Archimedean  law,  there  exists  an  integer  m'  such 
that  m'(  1)  >  r.  Let  M  be  the  set  of  all  integers  m'  such  that  m'  >  r.  By  the  postulate 
of  continuity,  this  set  contains  a  smallest  member  m.  If  r  =  0,  then  m  =  1.  If  r  <  0, 
show  that  m  —  -n-\-  1,  where  n  is  the  smallest  integer  such  that  n  >  —  r. 

7.2.12  (a)  Show  that  c  >  (c  +  d)/2  >  d.  (b)  By  hypothesis  c  —  d  >  0.  Then,  by  the 
Archimedean  law,  there  is  a  positive  integer  n  such  that  n(c  —  d)  >  1,  or  1  jn  < 
c  -  d.  Let  m  be  the  smallest  integer  such  that  m  >  nd  (see  Problem  7.2.11).  Then 
m  -  1  ^  nd,  or  (m  -  1  )/n  S  d.  That  is,  d  <  m/n  =  (m  -  1  )/n  +  1  /n  <  d  +  (c  -  d) 
=  c. 


7.3.3  (e)  Let/(n)  —  32n+2  —  8rc  —  9.  Show  that  /( 1)  =  64  and  that  f(k  +  1) 

-f(k)  =  8(9*+!  -  1)  =  64(9*  +  9*-1  +  •••  +  1).  (f)  If/(n)  =  9n  -  Sn  -  1, 

show  that /(A:  +  1)  =  9 f{k)  +  64*.  (h)  ak+l  -  bk+l  =  a(ak  -  bk)  +  bk{a  -  b). 

7.3.7  Show  that  u\  =  u<i  =  1  and  that  u.k+  2  =  Uk  +  Uk+ 1-  Now  employ  the  third 
principle  of  mathematical  induction,  taking  h  =  2.  The  sequence  of  numbers  U\,  u<i, 
•  •  •  is  known  as  the  Fibonacci  sequence. 

7.3.9  The  passage  from  P{k)  to  P(k  +  1)  assumes  *  ^  2. 

7.3.10  If  max  (a,b)  =  k  +  1,  it  does  not  necessarily  follow  that  a  and  /3  are  both 
natural  numbers. 

7.3.12  Let  M  be  a  set  of  natural  numbers  containing  1  and  containing  *  +  1 
whenever  it  contains  k.  Suppose  M  does  not  contain  all  the  natural  numbers,  and 
let  F  be  the  set  of  all  natural  numbers  not  in  M.  By  the  principle  of  the  least  natural 
number,  there  is  a  smallest  natural  number  t  in  T.  Since  1  is  in  M,  t  >  1  and  t  —  1 

exists.  Since  t  is  the  smallest  natural  number  not  in  M,  t  —  1  is  in  M.  Hence,  by 

hypothesis,  t  is  also  in  M.  But,  since  t  is  in  T,  this  is  impossible. 

7.3.14  (a)  See  J.  E.  Freund. 
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7.4.2  (a)  We  are  given  (a  +  c,  b  +  d)  =  (a  +  e,  b  +  /),  whence  a  +  c  +  b  +  f 
=  b  +  d+  a-\-e  and  c  +/  =  d  +  *,  which  gives  (c,d)  =  (ej).  (b)  Suppose  (x,y) 
is  a  positive  integer  and  let  *  =  y  +  *.  Then  (y  +  Z,y)(a,b )  =  (y  -f  z,y)(c,d).  Perform 
the  multiplication,  use  the  definition  of  equality,  and  finally  employ  the  cancellation 
laws  for  natural  numbers  to  obtain  b  +  c  =  a  +  d.  The  proof  when  (x,y)  is  a  negative 
integer  is  similar,  (c)  Show  that  (c  +  b,  d  +  a)  is  a  solution.  Now  let  (V,/)  be  any 
solution.  Then  (a,b)  +  (e  +  b,  d  -j-  a)  =  ( a,b )  +  (*',/),  whence,  by  part  (a),  (x',y') 

—  (c  -\-  b,  d  T*  <3). 

7.4.3  (a)  Letjy  =  (a,b).  Then  — y  =  ( b,a )  and  —  (— y)  =  (a,£)  =  jy.  (b)  Let  a:  = 

(a,b),  y  =  (c,rf).  Then  —  *  =  0,a),  —  7  =  (V,c),  and  (~x)(—y)  =  ( b,a)(d,c )  = 
(W  +  ac,  be  +  arf)  =  ( a,b)(c,d )  =  ry.  (c)  Let  *  =  ( a,b),y  =  (y,/).  Then  —a:  =  (6, <2) 
and  (  —  =  ( b,a)(c,d )  =  (6r  +  ad,  ac  +  W)  =  —  [(<2,6)(r,<i)  ]  =  —(xy),  etc.  (d)  Let 

x  =  0,6),  JV  =  M).  Then  -  (x  +  y)  =  -  [(. a,b )  +  (c,d)  ]  =  -(a  +  c,  b  -f  d)  = 
(b  +  d,  a  +  c)  =  (A, a)  +  0,c)  =  (  —  at)  +  (— y)  =  —x—y. 

7.4.5  (b)  In  proving  that  addition  is  well  defined,  make  use  of  the  fact  that  ( ma,mb ) 

—  ( a,b )  if  m  0. 

7.5.1  (c)  Choose  A'  to  be  the  set  of  all  rational  numbers  of  the  form  —at,  where 
x  is  any  element  of  B  except  the  possible  smallest  element.  Let  B'  be  all  the  remaining 
rational  numbers. 

7.5.6  (b)  [x,y]  =  [{ac  -f-  bd)/(c 2  -f~  d2),  ( be  —  ad)/(c2  +  d2)]. 

7.5.7  (b)  Use  part  (a)  and  mathematical  induction. 

8.2.2  (a)  We  have 

(a u  b) r\  {b \j  c)  =  [ar\  {b\j  c)]\j  [b r\  {b\j  c)] 

=  {a  r\  b)  U7  {a  c)  b . 

Therefore 

{a  V_7  b)  r\  {b  yj  c)  (c  a ) 

=  [(an  b)U  (an  e)U  b]n  (cU  a) 

=  [(ani)n(tUa)]w  [(a  nc)n  (cU  a)]u  [b  n  (c \j  a)  ] 

=  [(flnjnc)U(anm«)]U[(an^f)U(ancna)] 

0  [( b  r\c)\J  (b  (~\  a)  ] 

=  [(ar\br\c)\j  (ar^b)]\j  [(ar\c)]VJ  to n  u  \j  (b r\  a) ] 

=  (a  C\  b)  (a  C\  c)  07  (b  C\  c) . 

8.2.3  Suppose  x  =  z-  Then 

(xr\t')Kj  (xfnt)  =  bnou  («n/>  =  Z^J  t  =  t. 

Now  suppose  t  =  (*  /')  07  (x'  C\  t).  Then 

xr\t  =  xr\  {(xru')\j  (x'r\t)]  =  [xn  (xru')\yj  [xf\  (xT w)] 

=  [(*r»  f\ w  [(in  ^rw]  =  (*n  <) 

=  (xr\i')\j  z  =  *rw'. 

Therefore 

x  =  xr\u  =  x(~\  (t\j  f)  =  (xru)\j  (xCu')  =  ( xr\t)yj  (*n<) 

=  xCu  =  xr\(tr\t)  =  (*rA  t)ru  =  (*n  1  =  0 

—  x  r\  z  —  Z- 

8.2.9  (d)  =  (a£)a  +  ^  ^ 

8.2.10  See  R.  L.  Goodstein  [2]. 
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8.3.1  Let  M i  be  the  midpoint  of  AB,  M 2  the  midpoint  of  M\B ,  M 3  the  midpoint 
of  M2B,  etc.  Denote  by  E  the  set  of  all  points  on  [AB]  with  the  exception  of  points 
A,  B,  Mi,  M2,  Mz,  •  •  • .  Then  we  have 

[AB]  =  E,  A,  B,  Mi,  M2,  Ms,  ••*, 

(• AB ]  =  E,  B,  Ml,  M2,  Mz, 

[AB)  =  E,  A,  Mi,  M2,  Mz, 

{AB)  =  E,  Mi,  M2,  Mz,  •  •  • . 

It  is  now  apparent  how  we  may  put  the  points  of  any  one  of  the  four  segments  in 
one-to-one  correspondence  with  the  points  of  any  other  one  of  the  four  segments. 

8.3.6  Let  d  denote  the  cardinal  number  of  the  set  of  all  natural  numbers.  Then 
the  one-to-one  correspondence 

1  2  3  4  n  ... 

2345  •  •  •  n  -p  1 

shows  that  d  =  d  -f-  1.  This  implies  that  d  -\-  1  =  (V+1)  +  1  =  d  2,  although 
1  ^  2. 

Again,  the  one-to-one  correspondences 

1  2  3  4  n  ... 

2  4  6  8  2n 


1  357*  •  *2  n  —  1* 

show  that  d  +  d  =  d.  Hence,  by  Theorem  3  of  Section  8.3,  (1  +  1  )d  =  \d,  or  2d  = 
Id,  although  2^1. 

Essentially  these  results  have  been  stated  in  certain  elementary  books  in  the 
undesirable  form:  Infinity  is  unchanged  by  adding  on  or  multiplying  by  a  finite 
number. 

8.4.4  (b)  Consider  the  polynomial  x2  —  2.  (c)  Algebraic,  since  it  is  a  zero  of  the 
polynomial  x1  +  1.  (g)  Multiply  the  polynomial  through  by  the  lowest  common 
denominator  of  the  s. 

8.4.5  (b)  Use  the  idea  employed  in  Theorem  1  of  Section  8.4. 

8.4.7  (a)  Use  an  indirect  argument,  along  with  Problem  8.4.5  (a). 

8.4.8  (a)  See  Problem  E  832,  American  Mathematical  Monthly,  56  (1949),  407. 

(c)  No,  for  there  are  c  points  on  a  straight  line  or  a  circle,  and  there  are  only  d 
rational  numbers  and  d  algebraic  numbers. 

8.4.10  It  is  a  matter  of  existence;  there  exists  a  greatest  triangle  inscribed  in  a 
circle,  but  there  does  not  exist  a  greatest  natural  number.  To  complete  argument  I, 
we  must  also  prove  that  a  maximum  triangle  inscribed  in  a  circle  exists.  The  problem 
illustrates  the  importance  in  mathematics  of  existence  theorems. 

8.4.12  (a)  In  each  interval  choose  a  point  with  rational  coordinate.  These  points 
are  all  distinct,  and  therefore  in  one-to-one  correspondence  with  the  intervals,  and 
they  constitute  an  infinite  subset  of  the  denumerable  set  of  all  rational  numbers. 

8.4.13  With  the  finite  sequence  (zzi,  n2,  •  •  •,  nr }  associate  the  natural  number 

n  =  2ni3n2  •  •  •  prnr , 

where  pr  is  the  rth  prime  number.  Since  factorization  of  a  natural  number  into 
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powers  of  primes  is  unique,  there  is  a  one-to-one  correspondence  between  the  set  of 
all  finite  sequences  of  non-negative  integers  and  the  set  of  all  natural  numbers. 

8.4.14  Assume  S'  is  denumerable.  Then  we  can  arrange  the  members  of  S'  in  a 
sequence  {fi(x),  f2(x),  ■  ■  •,/„(*),  •  •  •  }.  Consider  the  array. 

/.(I)  /.(2)  /i(3)  ••• 

.  M 1)  M2)  /s(3)  ••• 

fz(\)  M2)  />(3)  ••• 

•  •  •  •  •  « 

Now  form  the  function  f(x)  such  that  j(n)  =  fn(n)  +  1.  Then  /(*)  belongs  to  S. 
But/(*)  cannot  be  in  the  given  denumeration,  for  it  differs  from  fx(x)  in  the  value 
taken  by  *  =  1,  from  /2(*)  by  the  value  taken  by  *  =  2,  and  so  on. 

8.4.15  Let  K  be  the  set  of  all  natural  numbers,  and  let  a  R  b  mean  a  <  b.  Let  K' 
be  the  set  of  all  natural  numbers,  but  define  a  R  b  as  follows:  if  a  is  odd  and  b  is  even, 
we  write  a  R  b;  if  a  and  b  are  both  odd  or  both  even,  then  a  R  b  means  a  <  b.  Each 
of  these  systems  satisfies  Postulates  PI,  P2,  P3,  P'4,  but  the  two  systems  are  not 
somorphic. 

8.5.2  (a)  S  plus  the  jy-axis.  (b)  S  plus  the  circle  x2  -\-  y2  =  9.  (c)  The  origin. 
(d)  S  plus  the  segment  [  —  1,  1  ]  of  thej>-axis.  (e)  The  x-axis. 

8.5.3  For  Example  (3),  a  circle  is  a  square  with  center  at  C  and  having  its  diagonals 
equal  to  2 r  in  length  and  lying  parallel  to  the  coordinate  axes. 

8.5.5  (c)  It  is  only  the  verification  of  the  triangle  inequality  that  presents  any 
difficulty.  Denote  p(y,z),  p(z,x),  p(x,y)  by  a,  b,  c,  respectively.  Then  we  have 

b/(  1  +  b)  =  1/(1/*  +  1)  g  1/[1  /(c  +  a)  +  1]  =  (c  +  a)  1(1  +  c  +  a) 

—  cj(  1  +  c  +  a)  +  a/(l  +  c  +  a)  ^  c/(  1  +  c)  -f-  af(  1  +  a). 

(Notice  that  for  this  metric  all  distances  are  less  than  1.) 

9.1.2  (e)  There  will  be  8  lines  in  the  truth  table. 

9.1.5  Use  the  logical  equivalences  tabulated  at  the  end  of  Section  9.1. 

9.1.8  Use  the  results  of  Problem  9.1.7. 

9.1.10  See  Problem  9.1.9. 

9.1.11  (a)  Show  that  (/><->  q)  [(p  — >  q)  A  (q  — »  p)  ]  is  a  tautology. 

9.1.13  All  propositions  constructed  by  applying  A,  V,  to  the  proposition 

p  are  true  propositions  provided  p  is  true;  pr,  on  the  other  hand,  has  the  truth  value 
opposite  to  that  of  p.  Therefore  p'  cannot  be  expressed  without  the  employment  of 
negation. 

9.2.1  (a)  Substituting  m  for  p  in  Postulate  LI,  we  obtain  (m  V  m)  *  m.  But  we  are 
given  m  V  rn.  Hence,  by  Rule  R3,  we  have  m.  (b)  Substituting  m  for  q  in  Postulate 
L2  we  obtain  m — >  (p  V  m).  But  we  are  given  m.  Hence,  by  Rule  R3,  we  have/?  V  m. 
(c)  Substitute  m  for  p  and  n  for  q  in  Postulate  L3,  etc.  (d)  Substitute  m  for  q  and  n 
for  r  in  Postulate  L4,  etc.  (e)  Start  with  Theorem  1  and  use  Rule  R3  twice. 

9.2.3  By  Theorem  3,  p  — ►  p.  Therefore,  by  Rule  R2,  p  ->  q  and  q->p,  and  by  Rule 
R4,  (p—>q)  V  (q— >  p),  whence,  by  Definition  3,  /><->  q. 

9.2.4  (a)  Substitute p'  for  p  in  Theorem  6,  and  then  use  Rule  RIO  based  on  Theorem 
11.  Or,  start  with  LI  and  use  Rule  RIO  based  on  Theorem  12.  (b)  By  Theorem  4 
and  Rule  RIO  based  on  Theorem  11,  pr  V  (p'Y •  By  Rule  RIO,  again  based  on 
Theorem  11,  { \p'  V  {pr)'Y}'-  Therefore,  by  Definition  2,  (p  A  p  )  . 
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9.2.5  (a)  Substitute  p'  for  p,  and  q'  for  q,  in  Postulate  L3,  and  then  use  Definition  1. 

(b)  Replace  q  by  q'  in  the  tautology  of  part  (a),  etc.  (c)  By  Theorem  16,  (m  — >  n) 

>  (n  >  m  )  and  ( n  *  m)  >  ( m '  *  nr).  But  m  — >  n  and  n  — >  m.  Therefore  n'  — >  m! 

and  m'  — >  n\  or  m'  <— >  n! . 

9.2.6  (a)  Substitute  p'  for  p  in  Postulate  L2  and  use  Definition  1.  (b)  Substitute  p 

for  q  and  q  for  p  in  Postulates  L2  and  L3,  and  then  use  Rule  R9  [see  Problem  9.2.1 
(e)].  (c)  Substitute  p'  for  p  in  the  tautology  of  part  (b),  and  use  Definition  1. 

9.2.9  (a)  a'  \J  b,  (b)  (a  V  b)  »  b. 

9.2.11  Suppose  two  contradictory  tautologies  s  and  s'  can  be  obtained.  Replacing  p 
in  Theorem  22  by  x  we  obtain,  by  Rl,  the  tautology  s'  — ►  (s—>  q).  Now  applying  R3 
we  find  that  s  >  q  is  a  tautology.  Applying  R3  again,  we  find  that  q  is  a  tautology. 
But  q  represents  any  proposition,  and  therefore  need  not  be  a  tautology.  It  thus 
follows  that  the  calculus  of  propositions  cannot  be  inconsistent,  that  is,  it  is  consistent. 
The  proof  illustrates  the  direct  method  of  establishing  absolute  consistency.  Note  that 
the  method  has  been  applied  to  a  system  containing  an  infinite  number  of  primitive 
elements. 

9.3.5  mm~2  ( m  —  l)m2+1. 

9.3.6  (a)  4.  (b)  2916. 

9.4.6  The  complement  of  any  member  of  Ad  is  a  member  of  W,  and  conversely. 
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Abas  and  dabas,  87,  120 
Abelian  group,  140 
Abelian  semigroup,  140 
Absolute  independence,  195 
Absorption  laws  of  Boolean  algebra,  245 
Abstract  spaces,  259 
Hausdorff  space,  259 
Hilbert  space,  261 
metric  space,  261 
taxi-cab  space,  261 
topological  space,  263 
Absurdity,  307 

Ackermann,  W.,  collaboration  with 

Hilbert  on  formalist  program,  304 
Adams,  H.,  on  the  nature  of 

mathematics  (quoted),  190 
Addition  of  points,  158 
Ahmes  papyrus,  4 
Aleph-null,  253n 
Algebraic  geometry,  111 
Algebraic  numbers,  definition  of, 

255,  322,  326 
denumerability  of,  255 
Algebraic  structure,  128-133 
Alice  in  Wonderland  (Lewis  Carroll).  24 
Almagest  (Ptolemy),  60 


American  Mathematical  Monthly: 

Problem  E  832,  378 
Problem  E  1098,  376 
Problem  E  1152,  376 
Problem  2894,  376 
Analogy,  reasoning  by,  6 
Analysis  situs,  148 
Analytic  geometry,  105-111 
interpretation  of  Hilbert’s 
postulates,  107 
invention  of,  105-106 
a  method  of  geometry,  110 
Analytica  posteriora  (Aristotle),  18, 

34,  35,  39 

Angle  of  parallelism,  71 
Apiary,  192 

Apollonius,  Greek  mathematician  of 
antiquity,  48 

history  of  analytic  geometry,  105 
Applicable  surfaces,  85 
Applied  mathematics,  definition  of,  169 
examples  of,  172-173 
Arabs,  custodians  of  mathematics 
during  Dark  Ages,  49 
Archimedean  law  of  real  numbers,  206 
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Archimedes,  application  of  the 
axiomatic  method,  47-48 
mentioned  by  Proclus,  30-31 
On  Floating  Bodies ,  48 
On  Plane  Equilibriums ,  47,  57 
On  the  Sphere  and  Cylinder ,  48 
postulate  of,  48 

Aristotle,  Analytica  posteriora,  1 8, 

34,  35,  39 

on  the  axiomatic  method  (quoted), 
34,  35 

on  axioms  and  postulates,  35 
irrationality  of  V2,  17 
laws  of  logic,  290 
Metaphysics ,  32 

petitio  principii  in  theory  of  parallels,  41 
teacher  of  Eudemus,  12 
Arithmetic  in  Nine  Sections ,  21 
Arithmetic  mean,  25 
Arithmetization  of  analysis,  202 
Aryabhata,  Ganita ,  21 
Associative  laws,  129 
of  Boolean  algebra,  245 
Ausdehnungslehre  (Grassmann),  137 
Autological  adjectives,  309 
Axiom,  challenging  an,  295 
of  choice,  328-331 
independence  of,  331 
statement  of,  329 
of  reducibility,  302 
systems  ( see  Postulate  systems) 
versus  postulate,  34-37 
Axiomatic,  or  postulational,  method, 
166-170 

Aristotle  on,  34,  35 
Euclid’s  conception  of,  41-42 
origin  of,  16-18 
Proclus  on,  36-37 
Axiomatics,  formal,  15,  169 
material,  14-16,  169 

Banach-Tarski  paradox,  331 
Beatley,  R.,  postulational  treatment  of 
high  school  geometry,  100 
Bees  and  hives,  192 


Begrijfsschrift  (Frege),  276 
Bell,  E.  T.,  on  the  nature  of 

mathematics  (quoted),  77 
Beltrami,  E.,  consistency  of 

non-Euclidean  geometry,  74 
resurrection  of  Saccheri’s  work,  66 
surfaces  of  constant  negative 
curvature,  76 

Berkeley,  Bishop  G.,  criticism  of 
foundations  of  the  calculus, 
198-199 

Bernays,  P.,  collaboration  with  Hilbert 
on  formalist  program,  304 
on  postulates  for  calculus  of 
propositions,  288 
reconstruction  of  set  theory,  298 
Bernoulli,  Johann,  function  concept,  264 
infinitesimals  (quoted),  199 
Bible,  quotation  from,  24 
value  of  7 r,  2 n 

Birkhoff,  G.,  application  of  many-valued 
logics  to  quantum  theory,  292 
on  modern  algebra  (quoted),  137-138 
Birkhoff,  G.  D.,  postulational  treatment 
of  high  school  geometry,  100 
Bocvar,  D.  A.,  three-valued  logic,  294 
Bolyai,  J.,  absolute  science  of  space,  70 
challenging  an  axiom,  295 
discovery  of  non-Euclidean 
geometry,  70 

Bolyai,  W.,  father  of  Johann  Bolyai,  70 
Boole,  G.,  Boolean  algebra,  248 
Investigation  into  the  Laws  of  Thought , 

An,  275 

Mathematical  Analysis  of  Logic,  The,  275 

Boolean  algebra,  117,  243-249 

absorption  laws,  245 

associative  laws,  245 

definition  of,  244 

De  Morgan’s  laws,  246 

idempotent  laws,  245 

postulates,  244 

principle  of  duality,  245 

Boolean  ring,  268 

Boole-Schroder  algebra,  275 
* 

Borel,  E.,  axiom  of  choice,  329 
Boundlessness  of  straight  lines,  45 
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Brianchon,  C.  J.,  French  geometer,  112 
theorem,  114 

Brouwer,  L.  E.  J.,  founder  of 
intuitionist  school,  302 
Burali-Forti,  C.,  paradox,  296 

Calculus  of  propositions,  281-290 
a  Boolean  algebra,  289 
definition  of,  282 
postulates  for,  283,  288 
principle  of  duality,  289 
rules  for  obtaining  theorems,  283-284 
theorems,  284-287 

Campanus,  J.,  translation  of  Euclid’s 
Elements ,  50 
Cancellation  laws,  132 
Cantor,  G.,  aphorism,  77,  295 
denumerability  of  algebraic 
numbers,  255 
denumerability  of  rational 
numbers,  254 
diagonal  process,  255 
existence  of  transcendental  numbers, 
256,  326 

genetical  development  of  real 
number  system,  208,  222 
on  the  nature  of  mathematics 
(quoted),  190 

nondenumerability  of  real  numbers, 
255 

originator  of  set  theory,  249 
paradox,  296-297 
transfinite  numbers,  252-256 
Cap,  241 

Cardinal  number,  of  continuum,  257 
definition  of,  251 

Carlyle,  T.,  translation  of  Legendre’s 
Elements  de  geometries  67 n 
Carnap,  R.,  contributor  to  logistic 
program,  301 

Carroll,  Lewis,  quotation  from 
Alice  in  Wonderland ,  24 
Cartesian  product,  251 
Categorical  proposition,  24 
Categoricalness  of  postulates,  181-182 
Catenary,  86 


Catoptrica  (Heron),  28 
Cauchy,  A.  L.,  development  of  a 
theory  of  limits,  202 
resolution  of  the  second  crisis  in  the 
foundations  of  mathematics,  296 
Cayley,  A.,  challenging  an  axiom,  295 
lines,  124 

matric  algebra,  138 
on  projective  geometry  (quoted),  147 
Center  of  inversion,  122 
Ceva’s  theorem,  80,  126 
Cevian,  82 

Chasles,  M.,  French  geometer,  112 
history  of  geometry,  111 
Church,  A.,  on  systems  of  logic 
(quoted),  290-291 
theorem,  336 

Chwistek,  L.,  contributor  to  logistic 
program,  301 
Cigar  game,  27 
Circle  of  inversion,  122 
Clairaut,  A.  C.,  function  notation,  264 
Cohen,  P.  J.,  independence  of 

continuum  hypothesis,  257 
Collapsing  compasses,  318 
Commandino,  F.,  translation  of  Euclid’s 
Elements ,  51 

Commentary  on  Euclid ,  Book  I  (Proclus), 

30,  36,  61 

Commutative  laws,  129 
Commutative  ring,  137,  144 
Complement  of  a  set,  241 
Complete  independence,  184 
Completeness  of  postulates,  180-183 
Complex  numbers,  classification  of,  228 
genetical  development  of,  225-226 
Hamilton’s  treatment  of,  134-135 
Conclusion,  7 

Condition,  necessary,  sufficient, 

necessary  and  sufficient,  307 
Conformal  transformation,  123 
Conjunction,  definition  and 
notation,  277 
truth  table,  279,  292-294 
Consistency,  absolute,  175 
direct  test  for,  177-178 
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of  postulate  sets,  175-178 
relative,  177 

Constructible  set  of  numbers,  322 
Contingency,  307 
Continuity,  265 
postulate,  44 
Continuum,  260 
hypothesis,  257 

independence  of,  257 
Contrapositive  proposition,  307 
Contrapositive  statement,  24 
Converse  proposition,  307 
Converse  statement,  24 
Copernicus,  N.,  challenging  an 
axiom,  295 

Crises  in  the  foundations  of 
mathematics,  295-299 
Cross  ratio,  definition  of,  101 
a  descriptive  property,  1 1 3 
group,  159 

invariant  under  projection,  125 
Cup,  241 

Curvature  of  surface,  85 
Cut,  Dedekind,  222 
Cyclic  order,  192 
Cycloidal  arch,  22 


D’Alembert,  J.  le  R.,  foundations  of 
analysis,  201 

Dantzig,  T.,  on  the  nature  of 

mathematics  (quoted),  190 
Dark  Ages,  49 

De  arte  combinatona  (Leibniz),  275 
Decision  theory,  185,  305 
Dedekind,  R.,  contributor  to  logistic 
program,  300 
cut,  222 

definition  of  infinite  set,  253 
Eudoxian  theory  of  magnitude,  296 
genetical  development  of  real 
number  system,  208,  222 
postulate  of,  55,  98 
Stetigkeit  und  irrationale  d^ahlen,  222 
theorem  of  continuity,  206 
Was  sind  und  was  sollen  die  £ ahlen ?  222 


Deduction,  6-9 

explanations  of  Greek  preference 
for,  11 

Definition,  explicit,  91 
implicit,  91 
impredicative,  298 
by  induction,  212 
Dehn,  M.,  non-Archimedean 
geometry,  79 

postulate  of  Archimedes  and 

Legendre’s  two  theorems,  68 
De  Moivre’s  theorem,  238 
De  Morgan,  A.,  Formal  Logic ,  275 
foundations  of  algebra.  132 
laws  of,  246 
Denumerable  set,  253 
Denumerability  of,  algebraic 
numbers,  255 

rational  numbers,  254,  338 
Dependent  variable,  264 
Derived  tautology,  283 
Desargues,  G.,  projective  geometry, 
111-112 

seventeenth-century  French 
geometer,  51 

two-triangle  theorem,  125 
Descartes,  R.,  invention  of  analytic 
geometry,  51,  106 
Descriptive  properties,  112 
De  Sua,  F.,  on  formal  systems 
(quoted),  305 
Die  Theorie  der  Parallellinien 
(Lambert),  66 

Diophantus,  Greek  mathematician  of 
antiquity,  48 

non-Hellenistic  quality  of  his 
mathematics,  10 

Dirichlet,  L.,  function  concept,  264 
Disjunction,  definition  and  notation,  277 
exclusive,  306 
truth  table,  279 
Distance  function,  261 
Distributive  law,  129 
Division  ring,  137,  144 
Domain  of  definition,  264 
Doubtful  propositions,  295 
Duplication  of  the  cube,  323 
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Einstein,  A.,  challenging  an  axiom,  295 
theory  of  relativity,  78,  148,  259 
Elements  (Euclid),  14,  15,  30-47,  332 ff 
contents,  32-33 

definitions,  axioms,  postulates,  37-42 
first  28  propositions,  311-312 
formal  nature  of,  30-33 
Heiberg  text,  37 
logical  shortcomings  of,  42-47 
Theon’s  recension,  33 
translated  by  Campanus,  50 
translated  by  Commandino,  51 
translated  by  Robert  Simson,  51 
Elements  de  geometrie  (Legendre),  67 
translated  by  Thomas  Carlyle,  61  n 
translated  by  John  Farrar,  67 n 
Elements  of  a  subject,  31-32 
Empirical  reasoning,  2-5 
Empty  set,  240 
Epimenides  paradox,  298 
Equivalence,  classes,  152 
logical,  277,  280 
of  postulate  sets,  174-175 
relation,  152 

absolutely  independent  postulate 
set  for,  195 
truth  table,  279 

Eratosthenes,  Greek  mathematician 
of  antiquity,  48 
mentioned  by  Proclus,  31 
Erlanger  Programm,  146,  258 
Eubulides  paradox,  298 
Euclid,  conception  of  the  axiomatic 
method,  41-42 

Elements  [ see  Elements  (Euclid)] 
fifth  postulate,  58-61 

attempts  to  prove,  60-61 
substitutions  for,  60 
mentioned  by  Proclus,  30-31 
parallel  postulate,  41 
royal  road  in  geometry,  110 
Euclidean  algorism,  52,  369 
Euclidean  compasses,  53 
Euclidean  constructions,  317-325 
Euclidean  geometry,  relative 
consistency  of,  110 
Euclidean  tools,  40 


Euclides  ab  omni  naeuo  vindicatus 
(Saccheri),  64 

Eudemi an  Summary  (Proclus),  12-13, 
16-17,  25-26,  32,  39 
Eudemus,  early  history  of  geometry,  12 
Eudoxus,  definition  of  proportion,  332 
Euclid’s  Book  V  and  Book  VI,  33 
postulate  of  Archimedes,  48 
resolution  of  first  crisis  in  the 

foundations  of  mathematics,  18, 
295-296,  332-334 
Euler,  L.,  diagrams,  7-9 

formalism  in  mathematics,  200 
function  concept  and  notation,  264 
Swiss  mathematician,  7 
Exclusive  disjunction,  306 
Explicit  definition,  91 

Farrar,  J.,  translation  of  Legendre’s 
/ 

Elements  de  geometrie,  67 n 
Fermat,  P.,  creator  of  modern  number 
theory,  51 

invention  of  analytic  geometry,  106 
last  theorem,  335 
Fibonacci,  sequence,  376 
Field,  144 

number,  133,  319 
operations,  133 
ordered  ( see  Ordered  field) 
postulates,  132 

redundancies,  313-314 
theorems,  154-155 
Finite  group,  140 
Finite  projective  geometry,  117 
Fitch,  G.  D.,  on  the  nature  of 

mathematics  (quoted),  189 
Fluxion,  199 

Forder,  H.  G.,  postulational  treatment 
of  Euclidean  geometry,  99 
Formal  axiomatics,  15,  166-188 
pattern  of,  168 

Formal  Logic  (De  Morgan),  275 
Formalism,  304-305 
Formulaire  de  mathematiques  (Peano),  276 
Foundations  of  Analysis  (Landau),  217 
Fourier,  J.  B.  J.,  function  concept,  264 
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Fraenkel,  A.,  reconstruction  of  set 
theory,  298 

Frechet,  M.,  abstract  spaces,  259 
Frege,  G.,  Begriffsschrift ,  276 

contributor  to  logistic  program,  300 
definition  of  cardinal  number  of  a 
set,  251 

Grundgesetze  der  Arithmetik ,  276 
postulates  for  calculus  of 
propositions,  288 
Russell  paradox,  297 
symbolic  logic,  275-276 
Function  concept,  continuity,  265 
dependent  variable,  264 
domain  of  definition,  264 
history  of,  264-265 
independent  variable,  264 
range  of  values,  264 
Fundamental  theorem,  of  algebra, 

227,  256 

of  arithmetic,  52 

Galileo  Galilei,  area  under  arch  of 
cycloid,  22 

challenging  an  axiom,  295 
founder  of  science  of  dynamics,  51 
infinite  sets,  253 
Ganita  (Aryabhata),  21 
Gauss,  C.  F.,  legend  about  sum  of  angles 
of  a  large  physical  triangle,  87 
non-Euclidean  geometry,  69-70 
rigor  in  analysis,  201-202 
Gaussian  integers,  130 
General  topology,  263 
Geodesics,  74 
Geometric  algebra,  1 1 1 
Geometric  mean,  25 
Geometrical  paradoxes,  54-55 
Geometries,  146/ 

Geometrische  Untersuchungen  zur  Theorie  der 
P ar allellinien  (Lobachevsky),  71 
Geometry,  Klein’s  definition  of,  146 
Lorentz,  162 
non-Archimedean,  79 
non-Desarguesian,  126 


non-Euclidean  (see  Non-Euclidean 
geometry) 

non-Riemannian,  79 

plane  equiform,  146 

plane  Euclidean  metric,  146 

plane  similarity,  146 

projective  (see  Projective  geometry) 

Riemannian,  79 

[See  also  Elements  (Euclid)  and  topics 
under  Euclidean] 

Gergonne,  J.  D.,  French  geometer,  112 
principle  of  duality,  114 
“proof”  of  parallel  postulate,  80 
Gherardo  of  Cremona,  translator  of 
Arabic  works,  50 
Gibbs,  J.  W.,  vector  analysis,  157 
Godel,  K.,  axiom  of  choice,  331 
consistency  of  continuum 
hypothesis,  257 
first  theorem,  335 
incompleteness  theorem,  334-338 
index,  338 

limitation  of  the  formalist  program, 
305,  336 
number,  337 
second  theorem,  336 
Goldbach  conjecture,  335 
Grassmann,  H.  G.,  Ausdehnungslehre ,  137 
liberation  of  algebra,  132 
Greek  mathematics,  nature  of,  10 
origin  of,  10 

Gregory,  D.  F.,  foundations  of 
algebra,  131 
Group(s),  140-144 
Abelian,  or  commutative,  140 
consistency  of  postulates,  142 
cross  ratio,  1 59 
finite,  140 

fundamental  theorems,  143 
infinite,  140 

multiplication  table,  140 
non-Abelian,  140 
postulates,  140,  158 
significance  of,  144-148 
symmetric,  159 

symmetries  of  a  square,  162-163 
transformation,  146 
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Grundgesetze  der  Arithmetik  (Frege),  276 
Grundlagen  der  Geometrie  (Flilbert), 
94-100,  304 

Grundlagen  der  Mathematik  (Hilbert  and 
Bernays),  276,  305 

a' 

Halmos,  P.  R.,  symbol  for  end  of  a 
proof,  168 

Halsted,  G.  B.,  postulational  treatment 
of  high  school  geometry,  100 
Hamilton,  W.  R.,  challenging  an 
axiom,  295 

liberation  of  algebra,  132,  134-137 
quaternions,  135-137 
treatment  of  complex  numbers, 
134-135 

Hankel,  H.,  foundations  of  algebra,  132 
on  the  nature  of  mathematics 
(quoted),  189 

Hardy,  G.  H.,  reductio  ad  absurdum 
as  a  gambit,  63 
Harmonic  mean,  25 
Harriot,  T.,  algebraic  notation,  51 
Hauff,  J.  K.  F.,  “proof”  of  parallel 
postulate,  80 
Hausdorff  space,  259 
Heath,  T.  L.,  Thales’  measurement,  25 
translation  of  Heiberg  text  of 
Euclid’s  Elements ,  37 
Hegel,  G.  W.  F.,  three-valued  logic, 

291  n 

Height  of  a  polynomial,  255 
Helmer,  O.,  on  comparison  of 

equivalent  postulate  sets,  174-175 
Hermite,  C.,  transcendentality  of  e ,  257 
Herodotus,  origin  of  geometry,  1 
Heron,  approximate  construction  of 
regular  heptagon,  28 
Catoptrica ,  28 
geometry,  18 

Greek  mathematician  of  antiquity,  48 
method  of  approximating  square 
roots,  27 
Metric  a,  27-28 

non-Hellenistic  quality  of  his 
mathematics,  10 


Heterological  adjectives,  309 
Heyting,  A.,  intuitionist  symbolic 
logic,  292,  303 

Hilbert,  D.,  founder  of  formalist 
school,  304 

Grundlagen  der  Geometrie ,  94-100,  304 
number,  257 

postulates  for  plane  Euclidean 
geometry,  95-96 
space,  261 

Hilbert  and  Bernays,  Grundlagen  der 
Mathematik ,  276,  305 
Hindu-Arabic  numeral  system,  49 
Hindus,  custodians  of  mathematics 
during  Dark  Ages,  49 
Hippocrates  of  Chios,  early 

demonstrative  geometry,  13,  32 
Hobson,  E.  W.,  on  the  nature  of 
mathematics  (quoted),  189 
Hopkinson,  J.,  on  the  nature  of 

mathematics  (quoted),  190 
Horn  angle,  121 

Huntington,  E.  V.,  definition  of  point 
and  line,  372 

postulates  for  Boolean  algebra,  244 
treatment  of  Euclidean  geometry, 
99,  122 

Huygens,  C.,  theory  of  probability,  51 
Hyperbolic  functions,  67,  85 
Hyperparallels,  71 
Hypotheses  of  acute,  obtuse,  and 
right  angles,  64 


Idempotent  laws  of  Boolean  algebra, 
245 

Implication,  definition  and  notation, 
277 

material,  278n 
strict,  278rc 
truth  table,  279 
Implicit  definition,  91 
Impredicative  definition,  298 
Improper  integrals,  202 
Incompleteness  theorem  of  Godel, 
334-338 
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Independence  of  postulates  and  of 

postulate  systems,  absolute,  195 
complete,  184 
ordinary,  178-179 
strong,  180 
very,  195 

Independence  of  primitive  terms,  194 
Independent  variable,  264 
Index,  Godel,  338 
Induction,  6 

Inequality,  Schwarz’s,  123 
triangle,  123 
Infinite  group,  140 
Infinite-valued  logic,  291 
Infinitude  of  straight  lines,  44 
Integers,  genetical  development  of, 
217-220 

Integral  domain,  137,  144 
Interpretation  of  postulate  system,  169 
Intersection  of  sets,  241 
Intuitionism,  302-304 
Inverse  proposition,  307 
Inverse  statement,  24 
Inversion,  122 

Investigation  into  the  Laws  of  Thought ,  An 
(Boole),  275 

Irrational  numbers,  224 
Irrationality,  of  7 r,  67 
of  V2,  17 

Isogonal  conjugate  lines,  23 
Isomorphism,  219 
Isosceles  birectangle,  64 

Jordan  algebras,  139 
Journal  of  Symbolic  Logic ,  276 

Kantian  theory  of  space.  77 

Kepler,  J.,  laws  of  planetary  motion,  51 

Keyser,  C.  J.,  on  the  nature  of 

mathematics  (quoted),  189 
on  propositional  functions,  167 
Kirkman  points,  124 
Kleene,  S.  C.,  Godel’s  theorems,  336 
Klein,  F.,  application  of  groups  to 
geometry,  144^ 


definition  of  a  geometry,  146 
Erlanger  Programm,  146,  258 
model  of  Lobachevskian  geometry,  87 
Kronecker,  L.,  forerunner  of  the 
intuitionist  program,  302 
on  the  whole  numbers  (quoted),  227 
KTi-cF  ang  Suan-shu ,  21 
Kurple,  120 


Lagrange,  J.  L.,  attempted  rigorization 
of  analysis,  201 

Theorie  des  fonctions  analytiques ,  201 
Lambert,  J.  H.,  Die  Theorie  der 
Parallellinien ,  66 
hyperbolic  functions,  67 
irrationality  of  7T,  67 
non-Euclidean  geometry,  66-67 
Landau,  E.,  Foundations  of  Analysis.  217 
Langford,  C.  H.,  contributor  to  the 
logistic  program,  301 
Law(s),  absorption,  245 
of  Archimedes,  206 
associative,  129,  245 
cancellation,  132 
commutative,  129 

of  contradiction,  62,  81,  188,  280,  287 
intuitionist  school,  303 
of  contraposition,  280,  286 
De  Morgan’s,  246 
distributive,  129 

of  double  negation,  280,  286,  294 
of  excluded  middle,  62,  81,  188, 

280,  285 

intuitionist  school,  303 
of  exponents,  228 
idempotent,  245 
of  logic,  280 
parallelogram,  290 
of  planetary  motion,  51 
Poretsky’s,  267 

of  sines  in  Lobachevskian  geometry,  70 
of  sines  in  spherical  geometry,  83 
of  the  syllogism,  280,  287 
of  triple  negation,  294 
Least  upper  bound,  205 
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Legendre,  A.  M.,  Elements  de  geometrie,  67 
first  theorem,  68,  82 
non-Euclidean  geometry,  67-68 
second  theorem,  68,  83 
Leibniz,  G.  W.,  contributor  to  the 
logistic  program,  300 
creation  of  the  calculus,  51,  198,  296 
De  arte  combinatorial  275 
function  concept,  264 
sum  of  an  infinite  series,  201 
Leon,  early  demonstrative  geometry,  32 
Lewis,  C.  I.,  strict  implication,  278 n 
Lie  algebras,  139 
Limit  point,  260,  262 
Lindemann,  C.  L.  F.,  transcendentality 
of  7 r,  257 

Line  coordinates,  127 
Linear  continuum,  205 
Linear  order,  165 

Liouville,  J.,  existence  of  transcendental 
numbers,  326 
numbers,  327 

Lobachevskian  geometry,  71 
consistency  of,  100-105,  110 
Lobachevsky,  N.  I.,  challenging  an 
axiom,  295 

Geometrische  U nter suchungen  zur  Theone 
der  Parallellinien,  71 
non-Euclidean  geometry,  70-71 
Pangeometrie ,  71 
Logic(s),  laws  of,  280 

many-valued,  or  ra-valued,  291-295 
mathematical,  274 
non-Aristotelian,  292 
non-uniqueness  of,  290-291 
other,  290-295 

part  of  pattern  of  a  logical  discourse, 
187-188,  274,  290 
symbolic,  274 
three-valued,  293-295 
(See  also  Calculus  of  propositions 
and  Reasoning) 

Logica  demonstrativa  (Saccheri),  63 
Logical  equivalence,  280 
Logicism,  300-302 
Loop,  160 

left  inverse  property,  160 


right  inverse  property,  160 
Lorentz  geometry,  162 
Lukasiewicz,  J.,  independence  of 

postulates  of  two-valued  logic,  292 
m-valued  logic,  291 
postulates  for  calculus  of 
propositions,  288 
three-valued  logic,  291/ 

MacColl,  H.,  three-valued  logic,  291  n 
MacLane,  S.,  on  modern  algebra 
(quoted),  137-138 
Many-valued,  or  m-valued,  logics, 
291-295 

applied  to  quantum  theory,  292 
Mapping,  93 

Material  axiomatics,  1 4—1 6,  169 
pattern  of,  15 
Material  implication,  278 n 
Mathematical  Analysis  of  Logic ,  The 
(Boole),  275 

Mathematical  induction,  209 
Mathematical  logic,  274 
Mathematics,  applied,  169 
categories  of,  196-197 
origin  of,  1-2 
pure,  169 

Matric  algebra,  138-139 
Means,  arithmetic,  25 
geometric,  25 
harmonic,  25 
subcontrary,  25 

Menaechmus,  history  of  analytic 
geometry,  105 
Metamathematics,  174,  187 
Metaphysics  (Aristotle),  32 
Method  of  similitude,  20 

0 

Metric  properties,  112 
Metric  space,  circle,  272 
distance  function,  261 
metric  of,  261 
postulates  for,  261 
Metnca  (Heron),  27-28 
Michalski,  K.,  early  history  of 
three-valued  logics,  291  n 
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Model  of  a  postulate  system,  169 
concrete,  175 
ideal,  175 

Modern  compasses,  53,  318 
Moore,  E.  H.,  complete  independence, 
184 

Hilbert’s  postulate  system  for 
Euclidean  geometry,  121 
Moore,  R.  L.,  postulate  set  for 

set-theoretic  topology,  180 
Veblen’s  revised  postulate  system  for 
Euclidean  geometry,  99 
Moscow  papyrus,  4,  20-21 
Mystic  hexagram  theorem,  112-113, 
114,  124 

Napier,  J.,  invention  of  logarithms,  51 
Napoleon,  confiscation  of  Italian 
manuscripts,  33 

Nasir-ed-din,  “proof”  of  Euclid’s 
parallel  postulate,  61 
Natural  number  system,  207-216 
postulates  for,  208-209,  216 
redundancies,  314-317 
Nature  of  mathematics,  E.  T.  Bell 
(quoted),  77 

Cantor’s  aphorism,  77,  295 
B.  Peirce  (quoted),  170 
H.  Poincare  (quoted),  170 
quotations,  189-190 
B.  Russell  (quoted),  170 
Necessary  condition,  307 
Necessary  and  sufficient  condition,  307 
Negation,  definition  and  notation,  277 
truth  table,  279,  294 
Neighborhoods,  259 

Neumann,  J.  von,  application  of  many¬ 
valued  logics  to  quantum  theory, 
292 

collaborator  with  Hilbert  on 
formalist  program,  304 
reconstruction  of  set  theory,  298 
Newton,  I.,  creation  of  the  calculus, 

51,  198,  296 
Quadrature  of  Curves,  199 


Nicod,  J.,  postulates  for  calculus  of 
propositions,  288 
Non-Archimedean  geometry,  79 
Non-Archimedean  systems,  121-122 
Non-Aristotelian  logics,  292 
Non-Desarguesian  geometry,  126 
Non-Euclidean  geometry,  58-79 
consistency  of,  73-79 
discovery  of,  69-73 
Kantian  theory  of  space,  77 
significance  of,  76-77 
Non-Riemannian  geometries,  79 
Nuclear  propositions,  91 
Null  set,  240 

Number(s),  algebraic,  255,  322,  326 
constructible,  322 
e,  257 
Godel,  337 
Hilbert’s,  257 
Liouville,  327 
perfect,  53 
ir,  2,  19,  67,  257 
transcendental,  255,  326 
transfinite,  252-257 
Number  field,  133,  319 

square  root  extension  of,  319 
square  root  extension  chain,  319 

On  Floating  Bodies  (Archimedes),  48 
On  Plane  Equilibriums  (Archimedes), 

47,  57 

On  the  Sphere  and  Cylinder  (Archimedes), 
48 

One-to-one  correspondence,  249 
Open  sets,  263 
Open  sphere,  262 

Operation(s),  closure  property,  153 
commutative,  154 
definition  of,  128 
well  defined,  164 
Ordered  field,  133 
complete,  205 

postulates  for,  204-205 
positive  and  negative  elements  of,  133 
postulates  for,  132-133 
theorems,  155 
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Oresme,  N.,  history  of  analytic 
geometry,  105 

Orthocentric  tetrahedron,  22 
Oughtred,  W.,  algebraic  notation,  51 

Padoa,  A.,  independence  of  primitive 
terms,  194 

Pangeometrie  (Lobachevsky),  71 
Pappus,  Greek  geometer  of  antiquity,  48 
Paradox(es),  54-55,  229-233,  236 
Banach-Tarski,  331 
barber,  297 
Burali-Forti’s,  296 
Cantor’s,  296-297 
of  Epimenides,  298 
of  Eubulides,  298 
geometrical,  54-55 
in  logic,  309-310 
Russell’s,  297-298,  309 
in  set  theory,  296-298 
Parallax  of  star,  88 
Parallelogram  law  of  forces,  290 
Partial  order,  165 
Particular  affirmative,  23 
Particular  negative,  23 
Pascal,  B.,  French  seventeenth-century 
mathematician,  51 
line,  124 

mystic  hexagram  theorem,  112-113, 
114,  124 

Traite  du  triangle  arithmetique ,  210 n 
Pasch,  M.,  postulate,  46,  56,  95,  97 
treatment  of  Euclidean  geometry, 
90-92 

Pattern  of  formal  axiomatics,  168 
Pattern  of  material  axiomatics,  15 
Peacock,  G.,  foundations  of  algebra,  131 
Peano,  G.,  contributor  to  development 
of  logistic  program,  301 
Formulaire  de  mathematiqu.es,  276 
genetical  development  of  real 
number  system,  208 
postulates  for  natural  number 
system,  216 

treatment  of  Euclidean  geometry, 
92-93 


Peirce,  B.,  father  of  C.  S.  Peirce,  275 
on  the  nature  of  mathematics 
(quoted),  170 

Peirce,  C.  S.,  on  the  nature  of 

mathematics  (quoted),  189 
symbolic  logic,  275 
Perfect  numbers,  53 
Peyrard,  F.,  tenth-century  copy  of 
Euclid’s  Elements,  33 
Philolaus,  geometrical  harmony,  25 
Philosophies  of  mathematics,  299-305 
7 r,  Babylonian  value,  2 
Bible  value,  2 n 
Egyptian  value,  2 
Hindu  value,  19 
irrationality  of,  67 
transcendentality  of,  257 
Pieri,  M.,  definition  of  perpendicularity, 
371 

postulates  for  Euclidean  geometry, 

93,  120 

treatment  of  Euclidean  geometry, 
93-94 

Plato,  influence  on  mathematics,  13-14 
mentioned  by  Proclus,  31 
Pythagorean  triples,  26 
Plato’s  academy,  14,  31,  32 
Playfair’s  postulate,  60,  96,  97 
Pliicker,  J.,  German  geometer,  112 
coordinates  of  a  line,  127 
equation  of  a  point,  127 
lines,  124 

proof  of  principle  of  duality,  126-127 
Poincare,  H.,  forerunner  of  the 
intuitionist  program,  302 
imaginary  universe,  88-89 
impredicative  definitions,  298 
model,  76,  101-105,  123 
on  the  nature  of  mathematics 
(quoted),  170 
Points,  addition  of,  158 
Poles  and  polars,  115-116 
Poncelet,  V.,  principle  of  duality,  115 
projective  geometry,  112 
Poretsky’s  law,  267 
Post,  E.  L.,  m-valued  logic,  291 
three-valued  logic,  294 
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Postulate(s),  of  Archimedes,  48,  57, 

96,  97 

of  completeness,  96 

of  congruence,  96 

of  connection,  95 

of  continuity,  44,  55,  96,  205-207 

of  Dedekind,  55,  98 

definition  of,  168 

of  finite  induction,  209,  216 

independence  of,  195 

of  order,  95 

origin  of,  186 

of  parallels  (Euclid’s  fifth),  41, 

58-61,  96 
of  Pasch,  56,  95 
of  Playfair,  96,  97 
properties  of,  173-183 
of  Zermelo,  329 

Postulate  system(s),  abas  and  dabas,  87 
bees  and  hives,  192 
Boolean  algebra,  244 
commutative  (or  Abelian)  ring,  137 
complete  ordered  field,  204-205 
cyclic  order,  192 
division  ring,  137 
equivalence  relation,  152,  195 
Euclid’s,  39 
field,  132 

redundancies,  313-314 
a  finite  geometry,  193 
group, 140-158 
Hausdorff  space,  259 
Hilbert’s,  95-96 
integral  domain,  137 
Lobachevskian  geometry,  100 
m-classes,  193 
metric  space,  261 

natural  number  system,  208-209,  216 
independence  of,  317 
redundancies,  314-317 
ordered  field,  132-133 
partial  order,  165 
Pieri’s,  93 

plane  projective  geometry,  116,  126 
properties  of,  173-183 

absolute  independence,  195 
categoricalness,  181-182 


completeness,  180-183 
consistency,  175-178 
equivalence,  174-175 
independence,  178-179 
relative  weakness  of,  184 
strong  independence,  180 
verifiable,  178 
very  independent,  195 
weak  completeness,  181 
propositional  calculus,  283 
real  number  system,  204-205 
ring,  137 

ring  with  unity,  137 
semigroup,  140 
sfield,  137 

simple,  or  linear,  order,  165,  173,  328 
topological  space,  263 
well-ordered  set,  328 
Power  of  inversion,  122 
Pre-Hellenistic  mathematics,  nature 
of,  1-6 
Premises,  7 

Prime  numbers,  infinitude  of,  53 
Primitive  tautologies,  283 
Primitive  terms,  167 
Principal  curvatures,  85 
Principia  mathematica  (Whitehead  and 
Russell),  187-188,  249,  276,  282, 
288,  292,  301-302,  303 
Principle,  of  addition,  283 
of  duality,  114/ 

Boolean  algebra,  245 
calculus  of  propositions,  289 
partially  ordered  sets,  165 
Pliicker’s  proof,  126-127 
poles  and  polars,  116 
postulational  approach,  116 
spherical  geometry,  126 
trigonometry,  126 
of  mathematical  induction,  209 
second,  21 5 
third,  215 

of  permanence  of  equivalent 
forms,  131 
of  permutation,  283 
of  smallest  natural  number,  214 
of  summation,  283 
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of  tautology,  283 
vicious  circle,  298 

Proclus,  Commentary  on  Euclid ,  Book  /, 

12,  30,  36,  61 

on  definitions,  axioms,  and  postulates 
(quoted),  36-37 
on  elements  (quoted),  31-32 
about  Euclid  (quoted),  30-31 
on  Euclid’s  fifth  postulate 
(quoted),  59 
Eudemian  Summary ,  12 
Projective  geometry,  111-118 
definition  of,  114 
finite,  117 

poles  and  polars,  115-116 
postulates  for,  116,  126 
principle  of  duality,  114/ 
projecting  a  line  to  infinity,  125 
Proof  theory,  305 
Proper  subset,  241 
Proposition,  contrapositive,  307 
converse,  307 
definition  of,  276 
doubtful,  295 
inverse,  307 
undecidable,  294 
Propositional  function,  166-167 
Pseudosphere,  75 
Ptolemy,  C.,  Almagest ,  50,  60 

Greek  mathematician  of  antiquity,  48 
Ptolemy,  King,  royal  road  in 
geometry,  11  On 
Pure  mathematics,  169 
example  of,  170-172 
Pythagoras,  Greek  mathematician  of 
antiquity,  12-13,  295 
Pythagorean  school,  13 
Pythagorean  theorem,  24 
metrical  aspect  of,  112 
Pythagorean  triples,  26 

Quadrature  of  Curves  (Newton),  199 
Quaternions,  definition  of,  136 
multiplication  table,  136 
units,  136 


Quine,  W.  V.,  contributor  to  the 
logistic  program,  301 
logic,  186 

postulates  for  calculus  of 
propositions,  288 

Ramsey,  F.  P.,  contributor  to  the 
logistic  program,  301 
Range  of  values,  264 
Rational  cut,  223 
Rational  numbers,  density  of,  253 
denumerability  of,  254,  338 
genetical  development  of,  220-221 
Real  number  system,  196-228 
Archimedean  law,  206 
Dedekind’s  theorem  of  continuity,  206 
postulational  treatment  of,  203-207 
significance  of,  196-203 
Real  numbers,  genetical  development 
of,  222-225 

nondenumerability  of,  255 
Reasoning,  by  analogy,  6 
deductive,  6-9 
empirical,  2-5 
inductive,  6 
valid  and  invalid,  7 
Reductio  ad  absurdum,  61-63,  285 
considered  as  a  gambit,  63 
most  general  form,  287 
Regular  heptagon,  Heron’s 

approximate  construction,  28 
Reichenbach,  H.,  application  of  many¬ 
valued  logic  to  quantum  theory, 
292 

infinite-valued  logic,  291 
Relations,  148-153 
antisymmetric,  165 
asymmetric,  150 
determinate,  or  connected,  164 
distinctly  transitive,  165 
equivalence,  152 
intransitive,  151 
irreflexive,  150 
nonreflexive,  150 
nonsymmetric,  150 
nontransitive,  151 
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Transform,  161 
Transformation (s),  93 
group,  146 
identical,  145 
inverse,  145 
involutoric,  161 
nonsingular,  144 
product  of,  145 
Translations,  140 
Triangle  inequality,  123 
Trigonometry,  principle  of  duality,  126 
Trirectangle,  66 
Trisection  of  an  angle,  324 
Truth  tables,  conjunction,  279 
definition  of,  279 
disjunction,  279 
equivalence,  279 
implication,  279 
negation,  279 

in  three-valued  logic,  293-294 
Type-I  sequential  independence,  194 
Type-II  sequential  independence,  194 

Uber  die  Hypothesen  welche  der  Geometrie 
zu  Grunde  liegen  (Riemann),  45 
Undecidable  propositions,  294 
Union  of  sets,  241 
Universal  affirmative,  23 
Universal  negative,  23 
Universe,  241 

University  of  Alexandria,  31 
Upper  bound,  205 
least,  205 

Valid  and  invalid  reasoning,  7 
Veblen,  O.,  postulational  treatment  of 
Euclidean  geometry,  98 
Vector  analysis,  157-158 
Vector  product,  158 
Venn  diagrams,  242-243 
Very  independent  postulates,  195 
Vicious  circle  principle,  298 


Visual  space,  non-Euclidean,  78 

Vorlesungen  uber  die  Algebra  der  Logic 
(Schroder),  275 

Wallis,  J.,  “proof”  of  parallel 
postulate,  61 

Was  sind  und  was  sollen  die  y^ahlen? 
(Dedekind),  222 

Weakly  complete  postulates,  181 

Weierstrass,  K.,  arithmetization  of 
analysis,  202,  296 
continuous  nondifferentiable 
function,  202 

Weyl,  H.,  on  impredicative 
definitions,  299 

Whitehead,  A.  N.,  logistic  program, 
301-302 

on  the  nature  of  mathematics 
(quoted),  189 
theory  of  types,  186 

Whitehead  and  Russell,  Principia  mathe¬ 
matical  187-188,  249,  276,  282, 
288,  292,  301-302,  303 

Wilder,  R.  L.,  dependence  of  R.  L. 
Moore’s  sixth  postulate,  180 

William  of  Occam,  three-valued 
logics,  29 In 

Wittgenstein,  L.,  contributor  to  the 
logistic  program,  301 

Young,  J.  W.,  on  the  nature  of 

mathematics  (quoted),  189 

Zeno,  foreshadowing  the  second  crisis  in 
the  foundations  of  mathematics, 
296n 

Zermelo,  E.,  postulate  of,  329 
reconstruction  of  set  theory,  298 
well-ordering  theorem,  328 

Zwicky,  F.,  application  of  many-valued 
logic  to  quantum  theory,  292 
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